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Unit 6 


Truancy 


Introduction 


Introduction 


In Unit 6 we address the following question: 
How often do pupils truant? 


The topic of truancy has been chosen for this unit because it is an example 
that can be understood without technical background and which enables 
key statistical ideas to be illustrated and generalised. Statistics can enable 
informed discussion and decision-making to take place — decisions should 
be based on the careful analysis of reliable data. This is a module in 
statistics, not in education, but an aim in this unit and Units 7 to 9 is to 
demonstrate that statistics has an important role to play in the 
interpretation of data on education. 


The emphasis in this unit is on using statistical techniques to reach 
conclusions; however, background factors which may account for the 
conclusions are also discussed. This will give you the opportunity to think 
about the issues involved, perhaps with particular reference to your own, 
or your friends’, experiences. 














LE > 


Research shows that while the number of surfers 
is somewhat constant over the year, there is a 
sudden increase in casual bathers over summer..... 


It is worth mentioning that approaches to education in the UK change 
regularly. This module was written in 2012-13, and if you are studying 
M140 some years later, the system may have a different structure, and 
different issues may be important. However, the same statistical 
techniques will be valid, and many of the questions will still be relevant. 
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Schools in England 


The school system in England appears to be in a continual state of 
change. In the 1960s and 1970s, most grammar schools and secondary 
modern schools were replaced by comprehensive schools. Some City 
Technology Colleges started in the late 1980s and 1990s — they 
typically specialised in technology, science and mathematics, and 
often forged close links with businesses and industry. 


More recently, many schools have been changing their status to 
become academies, thus gaining greater independence, particularly in 
respect of budget control. In 2010 there were only 200 academies, but 
the number had risen to 1635 by March 2012. Also, free schools were 
introduced in 2011 as an extension of the Academies Programme. 
These schools are funded by the taxpayer, but they are not controlled 
by the local authority and may be set up by parents, teachers, 
charities and businesses. 


Unit 6, along with Units 7, 8 and 9, is a little more mathematical than 
earlier units. Do not worry if you find them difficult. The important thing 
is to ensure that you understand the results and how to apply them. In 
some cases, explanations are given of how results are derived. These are 
provided as you may find them interesting and they could increase your 
understanding. However, you will not be expected to reproduce these 
explanations in detail. 


This unit contains six sections. In Section 1, we consider what we mean by 
the question How often do pupils truant?. How are we going to measure 
truancy? Which pupils are we talking about? We shall narrow the 
question down until we have a more specific one that we are able to 
answer. In Sections 2 and 3 we introduce the concept of probability, which 
is fundamental to the ideas of statistical inference. In Sections 4 and 5 we 
describe a strategy for drawing inferences about a population from a 
sample, and introduce a particular method, known as the sign test, that 
follows this strategy. Section 6 directs you to the Computer Book, where 
you will use Minitab to calculate certain probabilities and perform a sign 
test. 


The unit introduces a lot of new concepts and ideas, some of which you 
may find difficult at first. However, you will find that these concepts occur 
many times in the remainder of the module, so you will have plenty of 
opportunity to become familiar with them. 


Writing down numbers 


In doing calculations, you have been encouraged to retain full 
accuracy in intermediate steps, perhaps writing down numbers in full 
to demonstrate that you are doing so. This is laborious and makes 
errors in copying down a number more likely. In the remaining units 
of this module, you need not do so. As far as you can, you should still 
use your calculator memory to store numbers to the calculator’s 
accuracy, but you need not write down all numbers in full (usually 
about five significant figures will be enough, enabling you to focus on 
the most important part of a number). 


1 Clarifying the question 


In Section 1 we consider what is meant by the question 
How often do pupils truant? 


Notice that this question refers implicitly to whole populations: for 
example, all schools in a particular area. We are usually not directly 
interested in how the children in one particular school behaved. However, 
it is often impossible, or at least not feasible, to collect data from the 
whole population. Instead we select a random sample of data. It might be 
a random sample of schools or of children. The sample is analysed by the 
methods we learned in earlier units, and we then need to decide how the 
results obtained from the sample apply to the whole population. 


Statistical inference makes inferences about a population on the 
basis of data drawn from that population. 


The above question about truancy may well have arisen from more general 
questions, such as: 


Why do some pupils learn very little? Are we using good ways of 

teaching? Does the quality of my child’s education depend on where 

I live? 
However, these latter questions can only be tackled if they are first made 
more precise. Hence, rather than simply posing a question, we will often 
need to clarify it, and we may need to clarify it more than once as we 
learn more about the problem. In earlier units we used the modelling 
diagram shown in Figure 1 as a framework for how we explore and 
summarise batches of data. 


1 Clarifying the question 


Random sampling was discussed 
in Subsection 1.2 of Unit 4. 
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Figure 1 Modelling diagram 

From now on we shall use a slightly modified form of this diagram in which 
e the differing roles of populations and samples are identified 

e the first box is changed from pose question to clarify question. 


The modified modelling diagram is given in Figure 2. 
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Figure 2 Modified modelling diagram 


The processes involved in statistical inference are very important. While 
we are concentrating on them, we will be concerned particularly with 
stages 3 and 4 of the modelling diagram. However, this does not mean that 
we can ignore the other stages. It is always important to ensure that the 
question under discussion is defined carefully and precisely. Also, although 
in this unit we shall not be particularly concerned with how the data were 
collected, we do need to know the form of the data, as this affects the way 
the sample is analysed. 


1 Clarifying the question 


1.1 The question to be clarified 


As we saw in earlier units, statistics is good at answering questions that 
require a numerical answer. However, the question for this unit is a very 
vague question. For example, we might be interested in how much 
particular children truant, or we might want to compare truancy at 
different schools. 


First, suppose we were looking at particular children. 


Activity 1 Factors affecting a child's truancy 


Spend a few minutes thinking about what factors might affect how much a 
child plays truant. Then write down three factors that you think might be 
relevant. 


Suppose a child psychologist is helping a particular child with a truancy 
problem. The psychologist would want to know the child’s attendance 
record and factors about the child’s circumstances that can influence 
truancy. The psychologist would then consider these factors and see if any 
pattern from the attendance record supported a given factor. 


The same approach is followed if you move from considering individual 
truancy to truancy associated with different schools. We shall concentrate 
on looking at patterns with regard to schools, not individual children. 


There are many different schools, and the amount of truancy will vary 
greatly. One of the interesting questions is whether different types of 
school have different amounts of truancy. 


Activity 2 Factors affecting truancy in a school 


Write down three factors that might affect the amount of truancy in a 
school. 


Age of children is one of the most important factors in truancy figures. 
There is much less truancy at primary schools than at secondary schools. 
Young children are more likely to be taken to school by their parents, and 
also, since they are usually with the same class teacher all the time, 
truancy would be more easily noticed and could be followed up more 
quickly. We shall concentrate on secondary schools. 


As you saw in the solution to Activity 2, there are still many factors that 
may affect truancy rate even after we have allowed for age to some extent 
by looking only at secondary schools. They include type of school, location 
of school and size of school, and there are also other factors, such as the 
attitude of the teachers, which are more difficult to measure. We shall look 
at several of these factors in the course of the unit, but we shall start with 
size of school. 


Is the truancy rate in large schools the same as the overall rate? A definition of ‘truancy rate’ is 
given in Subsection 1.2. 
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A sampling frame was defined in 
Subsection 4.6 of Unit 4 as a list 
of all individuals in the target 
population. 


To simplify the problem further, and to try to consider a fairly 
homogeneous group of schools, we shall look at large schools in the East of 
England region and compare them with all schools in the East of England. 
It would not be sensible to compare large schools in the East of England 
with the national average truancy rate, as the East of England has a much 
lower truancy rate than, for example, Inner London. If we want to 
investigate the effect of size of school, we want the schools to be alike in 
other respects as much as possible, and location is one way of achieving 
this. So we shall ask the following question. 


Do large secondary schools in the East of England have the same 
truancy rate as all secondary schools in the East of England? 


Changes in pupil performance — up or down? 


Pupils in the UK are steadily doing better when compared with their 
predecessors, while simultaneously doing worse when compared with 
the rest of the world. On the one hand, in 2011 the overall pass rate 
of A levels (qualifications typically used for university entrance in 
England, Wales and Northern Ireland) increased for the 29th 
consecutive year and more than a quarter of entries achieved the 
highest grade. Also in 2011, pass rates for Highers (qualifications 
typically used for university entrance in Scotland) hit a record high. 
On the other hand, Britain’s position in international education 
ratings dropped in most subjects after 2000. 


The OECD (Organisation for Economic Co-operation and 
Development) publishes reports on economic and social factors in its 
member states, including school performance league tables. Between 
2000 and 2006, the UK fell from 4th in the world to 14th in science, 
from 7th to 17th in literacy, and from 8th to 24th in mathematics. In 
part, no doubt, this reflects an increasing importance that other 
countries are placing on education. As Arne Duncan, the United 
States Secretary of Education, noted in a speech to UNESCO (United 
Nations Educational, Scientific and Cultural Organisation), ‘... ina 
knowledge economy, education is the new currency by which nations 
maintain economic competitiveness and global prosperity.’ 


1.2 Analysing the data 


We have now decided on a specific question to investigate and we need to 
collect some data. This unit is mainly about analysing data that have 
already been collected, but it is worth spending a little time thinking 
about exactly what data should be collected. We can assume that we have 
a sampling frame consisting of all state-funded secondary schools in the 
East of England and the number of pupils they have. We can therefore 
pick out large schools, and we have arbitrarily defined these to be schools 
with 1000 or more pupils. We can then select a random sample of these 
schools. A sample size of 12 has been chosen. 


We now want a single number to summarise the amount of truancy for 
each of the 12 schools. First we must consider what we mean by truancy. 
If a child skips school to go to the shops, then they are playing truant, 
while if they miss school because they are ill in bed, then they are not. 


Activity 3 What is truancy? 


Write down three reasons that a child might miss school — one that is 
definitely truancy, one that is definitely not truancy, and one that might or 
might not be truancy, depending on circumstances. 








A school in Colorado photographed in 1915 during the season to harvest 
beet — only five pupils are at school while another thirty-five are absent 
because they are helping with the beet work. 


A clear definition of truancy is needed if we are to gather truancy data for 
the different schools. The definition must take account of what data can be 
gathered, otherwise the definition may not be useful. In the next activity 
you are asked to think about how data related to truancy might be 
collected and used. 


Activity 4 Measures of truancy 


Think of two possible ways in which data on truancy in a particular school 
could be collected and truancy in the school measured. They should be 
feasible methods which will not occupy too much of the teachers’ time. 


1 Clarifying the question 
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This figure of 0.98% is a 
population median; see 
Subsection 3.2 of Unit 4. 
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When gathering data, precise definitions are needed. Hence the UK 
government collects data, not on truancy, but on ‘unauthorised absence 
from school’. An unauthorised absence is absence without permission from 
a teacher or other authorised representative of the school. Records are 
kept of when permission for absence has been given (which would be 
retrospectively in the case of illness), so unauthorised absence is a 
well-defined, documented quantity. It is clearly closely related to truancy. 
Indeed, when the government publishes statistics on unauthorised absences 
from school, television and newspapers refer to them as truancy rates. 
We shall do the same. 


Truancy rate 


One of the statistics on schools that the government publishes is the 
unauthorised absence rate, and we will adopt this as our truancy 
rate. 


e A pupil’s truancy rate is the proportion of school half-days that 
the pupil was absent without authorisation. 


e A school’s truancy rate is the average truancy rate of its pupils. 


You may well think this is not an ideal measure, and we shall return to this 
point in Subsection 5.3, but at present we shall concentrate on the analysis. 


The module team has used data that were already published. The data 
give truancy rates over the first two terms of the 2010/2011 school year. 
The percentage truancy rates in a sample of 12 large schools in the East of 
England were as follows. (Note that from here on, ‘school’ means 
‘secondary school’, unless otherwise qualified.) 


Table 1 Truancy rates (%) in 12 large schools in the East of England 
0.83 1.09 1.84 1.88 1.52 2.78 0.31 1.06 2.90 1.19 1.44 0.82 


(Data source: Department for Education (2011) Pupil absence in schools in 
England, autumn term 2010 and spring term 2011) 


The median truancy rate for all secondary schools in the East of England 
during the first two terms of 2010/2011 is known to be 0.98%. 


We can now move on to stage 3 and then stage 4 of the modelling diagram. 
We can analyse our sample and then try to interpret our results in terms 
of the whole population. This last step is known as statistical inference. 


Statistical inference 


This is a process of inferring back from the sample to the population. 
Somewhat paradoxically, the best approach to making inferences 
about populations from a sample is to consider what samples are 
likely to be obtained from a population. 


1 Clarifying the question 


To carry out statistical inference successfully, it is important that we have 
selected a random sample. 





‘The population of hula hoops tells you 
what Rind of hula hoops will be iw your sample. 
But the sample of hula hoops ts used to infer 
things about the population. 

Isw’t that circular Logic?’ 


Activity 5 Median truancy rate of sample of 12 large schools +a 


(a) Find the median truancy rate of the sample of 12 schools. 


(b) Can you infer back from the sample to the population of all the East 
of England schools? In particular, is it possible to infer that the 
population median of all large schools in the East of England is, or is 
not, equal to 0.98%? 


We do not expect you to give a definitive answer to part (b). Just 
think a little about the questions asked. 


The sample median rate of 1.32 is larger than 0.98 but not by an enormous 
amount. If we had chosen a different random sample, the median would 
probably have been different. Here is another random sample of size 12 of 
truancy rates in large schools in the East of England. This time the values 
have been written in ascending order: 


0.59, 0.67, 0.68, 0.94, 1.35, 1.38, 1.48, 1.54, 1.61, 1.86, 1.89, 1.99. 
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4. Interpret 


School 
below 
median 


A 
OO t 


12 


School 
above 


In this case the sample median is 5 (1.38 + 1.48) = 1.43. This is also bigger 


than 0.98 and a little larger than the sample median found in Activity 5. 
We should not infer back simply by looking at the sample and guessing. 

We need a systematic method, and the development of such a method is 
the main purpose of this unit. 


We shall look at the data of Table 1 in a different way. Instead of 
considering the sample median, we shall take each value in turn and record 
whether it is larger or smaller than 0.98. The first value of 0.83 is below 
0.98, so we shall record that as a negative difference, or |—]. The value 1.09 
lies above 0.98, so we record it as [+]. Working through the whole sample, 
we obtain: 


0.83 1.09 1.84 1.88 1.52 2.78 0.31 1.06 2.90 1.19 1.44 0.82 
HI ‘Pele che) E pe M B A H dna) bet H 


Altogether, we have nine values above 0.98 and three values below it. 


In Section 4, we are going to assume that the median percentage truancy 
rate for large schools in the East of England is indeed 0.98. We are then 
going to try to answer the question: 


How likely is it that a random sample of size 12 would contain nine 
values above and three values below the assumed population median? 


To answer this question, we need a way of making an informed judgement. 


If the median truancy rate for large schools were 0.98, it would not be 
surprising if there were six values above 0.98 and six values below it. This 
is the result that seems most likely, since the definition of a population 
median is that half the population values lie above it and half lie below it. 
We should expect a representative sample to be similar to the population. 
Also, after our work with samples in Unit 4, we should not be very 
surprised to find seven values above the median and five below it (or the 
other way round). 


However, it would seem surprising if all 12 values were either above or 
below the population median. So if all 12 were greater than 0.98 (the 
assumed population median), we would suspect that the population 
median for large schools was larger than 0.98. 


This suggests that some events are more likely to occur than others. The 
next activity suggests you think further along these lines. 


1 Clarifying the question 


Activity © Which events are more likely to happen? 


Look at the following list of events and rank them in order of likeliness to 
occur, starting with the one you think most likely. 


. Your new colleague at work has the same birthday as you. 
. Two out of a group of ten people have the same birthday. 
. The sun will rise tomorrow. 

. The sun will shine tomorrow. 


. You will win the jackpot in the National Lottery next week. 





J EI awe 


. You toss a coin and it lands tails up. 


G. You throw a die and it shows a six. (Remember ‘die’ is the singular 
form of ‘dice’.) 


H. A member of a hockey team is 150 years old. 


Activity 6 has demonstrated that some events are certain to occur, others 
are impossible, and the likelihoods of occurrence of others are somewhere 
between these two extremes. In the next two sections, we shall introduce 
the concept of probability, which is a method of associating a numerical 
measure with the likelihood of occurrence of a particular event. 


We shall then be able to use this concept to calculate how likely we are to 
observe nine values above and three values below the population median in 
a random sample of size 12. Then we shall be in a better position to use 
the sample to make inferences about the population and decide whether or 
not the median truancy rate of the population of large secondary schools 
in the East of England during the first two terms of 2010/2011 is equal to 
0.98%. 
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Exercises on Section 1 





Exercise 1 Your view on the likely occurrence of more events 


Look at the following list of events and rank them in order of likeliness to 
occur, starting with the one you think most likely. 


. A husband and wife find they were born on the same day of the week. 
. England will win the next football World Cup. 
. A Brazilian team will win the next football European Cup. 


. Death and taxes. 
. You get exactly three heads when you toss a coin five times. 


. A chicken egg has two yolks. 


A 
B 
C 
D 
E. A mother’s first pregnancy results in twins. 
F 
G 
H. It snows in London on Christmas day. 

I: 


You will be struck by lightning next year. 


WHOA! WE SHOULD GET INSIDE! 


IT5 OKAY! LIGHTNING ONLY KILLS 
ABOUT H5 AMERICANS A YEAR, SO 
THE CHANCES OF DYING ARE ONLY 
ONE IN 7000000. LET'S GO ON! 





THE ANNUAL DEATH RATE AMONG PEOPLE 
WHO KNOW THAT STATISTIC IS ONE IN SIX. 
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2 Probability 


At the end of Section 1, we established it would be useful to associate a 
number with an event so that it provided a measure of the likelihood of 
that event. This number is called the probability of occurrence of the 
event. The concept of probability plays such an important role in statistics 
that we are going to spend two sections investigating its properties, before 
returning to our specific question of truancy. 





‘How do you want it -the crystal mumbo -jumbo or 
statistical probability?’ 


2.1 Measuring chance 


The easiest way of thinking about probability is to equate it to proportion: 
the probability that a particular event will happen is the proportion of the 
time that it is expected to happen. When we toss a fair (unbiased) coin, 
for example, there is a fifty-fifty chance that the coin will land ‘heads’ 
because half the time it should land ‘heads’ and half the time it should 
land sale That is, the proportion of time that the outcome should be 


heads is 5, so the probability that the outcome will be heads is z. 


(In practice, of course, you can only toss a coin a finite number of times, 
and it is very unlikely to land ‘heads’ exactly half the time. For example, if 
you toss it 600 times, then there is little chance that it will land ‘heads’ 
exactly 300 times. However, if you toss a coin an enormous number of 
times, the proportion of ‘heads’ should be very close to 2.) 


2 Probability 
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See Subsection 1.2 of Unit 4. 


16 


Similarly, if a die is fair, then each of its six sides is equally likely to be the 
outcome when it is rolled. Thus, for example, the proportion of rolls that 
1 


should result in a 4 is z, so the probability of rolling a 4 is ġ. 


Probability = Proportion 


You met the ideas of random selection and random sampling in Unit 4. 
With random sampling, each member of the population is equally likely to 
be included in the sample. In particular, if a person or item is picked at 
random from a population, each member of the population is equally likely 
to be the one that is picked. We shall use these ideas to begin our 
investigation of probability. 





Example 1 Student probabilities 


Table 2 gives the breakdown of male and female students in a small 
university, by their faculty. (A student belongs to exactly one faculty.) 


Table 2 Student population of a small university 
Science Arts Law Medicine Total 


Female 964 1532 87 206 2789 
Male 1638 1039 247 369 3293 


Total 2602 2571 334 575 6082 


Now suppose that a student is selected at random from the university. 
Using the notion that probability equates to proportion, various 
probabilities relating to the student can be calculated. For example, the 
table gives the number of students who are female (2789) and the total 
number of students (6082). Hence we can determine the probability that 
the selected student is female by calculating the proportion of students 
who are female, as follows. 


The probability of selecting a female student is 
number of female students 2789 
total number of students 6082 
Similarly, the probability of selecting an arts student is 


~ 0.459. 





number of arts students 2571 
total number of students 6082 
and the probability of selecting a male science student is 





~ 0.423, 


number of male science students _ 1638 


—_—. mm i  — = — ~ 0.269. 
total number of students 6082 meee 





You have now covered the material related to Screencast 1 for 
Unit 6 (see the M140 website). 


Activity 7 Student probabilities 


E+ 
Ta 


Giving your answers to three decimal places, what is the probability that a = 
student chosen at random from the university population in Table 2 is: 


(a) a male student? 
(b) a law student? 


(c) a female medical student? 


Whenever you perform a calculation, you should look to see if your answer 
seems sensible — that way, you can sometimes spot large calculation errors. 
In the last activity you would expect your probability for (a) to be large 
because a lot of the students are male, while the probabilities for (b) 

and (c) should be small because there are comparatively few law students 
or female medical students. 


Activity 8 More probabilities 


Suppose we have a population of men and women and select just one 
person at random. For each of the following cases, give the probability 
that a woman is selected. 


(a) The population of size 10 consists of 5 men and 5 women. 
b) The population of size 10 consists of 1 man and 9 women. 


c) The population of size 10 consists of 9 men and 1 woman. 


The population of size 100 consists of 99 men and 1 woman. 


( 

( 

(d) The population of size 10 consists of 10 men and 0 women. 
(e 

( 


) 
f) The population of size 100 consists of 0 men and 100 women. 


In part (d) of Activity 8, the probability of selecting a woman was 0. This 
particular population contained no women, so it was impossible to select a 
woman. Similarly in part (f), the probability of selecting a woman was 1, 
and in that situation, the population contained no men, so a woman was 
certain to be selected. 


If you look at the other probabilities we have calculated, you will see that 
they are all between 0 and 1. This is because they have all been calculated 
as the proportion of a population who possess a certain property — for 
example, being a science student or being a woman. The closer the 
probability is to 1, the more likely the event is; the closer the probability is 
to 0, the rarer the event is. If you ever calculate the probability of an event 
and find it is either greater than 1 or a negative number, then you know 
you have made a mistake somewhere. 
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This discussion can be summarised by the following three important 
properties of probability: 


e If an event is impossible, then its probability is 0. 
e Ifan event is certain, then its probability is 1. 


e Any event which is uncertain but not impossible has a 
probability that lies between 0 and 1. 


At this point it is convenient to introduce some notation which cuts down 
the number of words we have to write. In the last activity you looked at 
the probability of the event that a person selected at random from a 
population is a woman. If we let W stand for the event a woman is 
selected, then we shall write P(W) for the probability that event W 
occurs, or the probability that a woman is selected. So 


number of women in population 





PW) = 
oe total number in population 
Similarly, we might let L stand for the event that a student selected at 


random is a law student. Then 
number of law students in population 


P(L) = 
(2) total number in population 


This can be extended to a more general situation. 


Let E stand for the event of selecting a person or object with some 
particular property from a population, using random sampling. Then 
the probability of FE is given by 

number in population with particular property 


Pb) = 
2 total number in population 


2.2 Adding probabilities 


Probability gives a way of measuring how likely an event is to occur in 
random sampling. In the last subsection you learned that a probability is 
always greater than or equal to 0 and always less than or equal to 1. The 
following example and activity use data on truancy to help you become 
more familiar with the idea of probability. The same data is then used to 
explore another property of probability. 
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Example 2 Truancy in two schools 


Table 3 shows some invented data on truancy in two schools, A and B, We have invented the data so 
that contained 200 and 100 pupils, respectively. that the numbers are small and 


the calculations are easy to 


Table 3 Truancy numbers in two schools check. 


Number of Number of pupils Total numbers 
days absent School A School B 


0 to 4 108 42 150 
5 to 9 60 30 90 
10 to 19 26 19 45 
20 or more 6 9 15 
Total 200 100 300 


We shall use the table to answer the following questions. 


1. 


If a child is selected at random from these two schools, what is the 
probability that this child was absent through truancy for fewer than 
5 days? 

If a child is selected at random from these two schools, what is the 
probability that this child is at School A and was absent through 
truancy for fewer than 5 days? 


If a child is selected at random from School A, what is the probability 
that this child was absent through truancy for fewer than 5 days? 


Let T stand for the event that a child selected at random was absent 
through truancy for fewer than 5 days, and let A stand for the event that 
the child attends School A. 


1. 


Here the probability is P(T). Now 
total number of children absent for < 5 days 


PD) = [1—.MMMMMMMMMMlM The symbol < means ‘less than’. 
(7) total number of children Similarly. > means ‘greater 
= 150 =05 than’, < means ‘less than or 
300 os equal to’ and > means ‘greater 


So there is a probability of 0.5 that a child picked at random from than or equal to’. 


these two schools was absent through truancy for fewer than 5 days. 


Here the probability is that both events T and A occur. This is 

P(T and A), which is an extension of our notation for the probability 
of an event. (It means the probability that both T and A occur. In 
this case, the event ‘T and A’ occurs if a child is absent through 
truancy for fewer than 5 days and also attends School A.) From 
Table 3, we see that 108 children attended School A and were absent 
through truancy for fewer than 5 days. So 


total number of children satisfying both T and A 
P(T Fcc aha ae 
(pane) total number of children 
108 
— = 0.36. 

300 


So the probability that a child attends School A and is absent through 
truancy for fewer than 5 days is 0.36. 
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3. You have to be careful here to appreciate what probability is required 
and how it differs from part (b). Probability is giving us precise 
answers, so we have to be careful that we ask precise questions and 
that the questions are the ones we want to answer. Here we assume 
that the child is selected at random from those at School A, so 
School A provides the total population. Thus 


P(child from School A is absent for < 5 days) 


total number of children at School A absent for < 5 days 
total number of children at School A 
108 


= — = 0.54. 
200 


So the required probability is 0.54. 


Activity 9 Truancy in two schools 


(a) Find the probability of each of the following events, using the data in 
Table 3. 


e A child chosen at random from these two schools is absent 
through truancy for between 5 and 9 days. 


e A child chosen at random from these two schools is absent 
through truancy for between 5 and 9 days and attends School B. 


e =A child chosen at random from those attending School A is absent 
through truancy for between 10 and 19 days. 


e A child chosen at random from these two schools is absent 
through truancy for 10 or more days. 


(b) For the first two events in part (a), define letters to denote the 
appropriate events as in Example 2. Use these letters to write in P(E) 
notation the probabilities you calculated. 


We next consider when probabilities can be added together. In 

Activity 9(a) you calculated the probability that a child selected at 
random from the two schools is absent for 10 or more days. This 
probability is 0.20. Now absence for 10 or more days means either absence 
for 10 to 19 days or absence for 20 or more days; in the context of the 
question, it does not matter which of these two events actually occurs. 


We can easily find the probability of occurrence of each of these events: 


45 
P(child absent for 10 to 19 days) = aa 0.15, 


15 
P(child absent for 20 or more days) = aa 0.05. 


The sum of these two probabilities is 0.15 + 0.05 = 0.20, which is the 
probability that a child selected at random is absent for 10 or more days. 


So we have 


P(child absent for > 10 days) 


= P(child absent for 10 to 19 days) + P(child absent for > 20 days). 


This is clearly not a coincidence, since 


P(child absent for > 10 days) 
total number of children absent for > 10 days 


total number of children 


number absent for 10 to 19 days + number absent for > 20 days 
total number of children 
number absent for 10 to 19 days number absent for > 20 days 


total number of children total number of children 


= P(child absent for 10 to 19 days) + P(child absent for > 20 days). 


This example was about two events, either of which might occur. However, 
it is impossible that both events occur at the same time. A child cannot 
both be absent for 10 to 19 days and for 20 or more days. Any two events 
with this property are called mutually exclusive. (The two events ‘absent 
for 10 or more days’ and ‘absent for 20 or more days’ are not mutually 
exclusive. If a child were absent for 21 days, both of these events would 
occur.) 


Two events are said to be mutually exclusive if they cannot occur 
at the same time. More generally, any number of events are said to be 
mutually exclusive if no two of them can occur at the same time. 


Activity 10 Exclusive events? 


Which of the following pairs or sets of events are mutually exclusive? 


(a) A person’s blood type is measured and the events are: (i) it is blood 
type A, and (ii) it is blood type O. 


(b) A person is described and the events are: (i) the person has black hair, 
and (ii) the person has blue eyes. 


(c) Some dice are rolled and the events are: (i) 3 dice are rolled, (ii) each 
die gives the same number, and (iii) the sum of the numbers on the 
dice is 8. 


(d) A person’s blood type is measured and the events are: (i) it is blood 
type A, (ii) it is blood type O, and (iii) the person’s blood is rhesus 
positive. 
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Suppose A and B are two mutually exclusive events. Since they are 
mutually exclusive events, both of them cannot occur at the same time. 
We shall denote the probability that one of A and B occurs by P(A or B). 
This probability is given by the addition rule for mutually exclusive events. 


Addition rule for mutually exclusive events (the ‘or’ linkage) 
For any two mutually exclusive events A and B, 
P(A or B) = P(A) + P(B). 


This rule extends to more than two events when they are all mutually 
exclusive. For example, if A, B and C are mutually exclusive events, 
then 


P(A or B or C) = P(A) + P(B) + P(C). 


Activity 11 Adding two probabilities 


This activity relates to the data in Table 3 in Example 2, reproduced 
below for convenience. 


Table 4 ‘Truancy numbers in two schools 


Number of Number of pupils Total numbers 
days absent School A School B 


0 to 4 108 42 150 
5 to 9 60 30 90 
10 to 19 26 19 45 
20 or more 6 9 15 
Total 200 100 300 


(a) Suppose a child is selected at random from School A. Find the 
probabilities that this child is absent through truancy for 


e 0 to 4 days 
e 5 to 9 days 
e 0 to 9 days. 


(b) Verify that the addition rule for mutually exclusive events holds in this 
case. 


There is a particular application of the addition rule which gives us a 
useful result about probabilities. Suppose E is some event that may or 
may not occur at a certain trial. (In statistics, a sequence of observations 
or a sequence of tests is often called a set of trials.) For example, the trial 
might be the selection of one child from the population of the two schools, 
and E might be the event that the child is absent through truancy for 0 to 
4 days. Let F be the event that E does not occur. Then F is the event 


2 Probability 


that a child is absent for 5 or more days. The events & and F are called 
complementary, because only one of them can happen and together they 
complete the list of possibilities. 


As one of the events E or F must occur, P(E or F) = 1. (Think about 
this. Is it possible that neither E nor F occurs?) 


Hence, by the addition rule, 
P(E or F) = P(E) + P(F)=1. 
So P(F) = 1 — P(E). We express this rule in words, as follows. 


Probability rule for complementary events (the ‘not’ linkage) 
For any event EF, 


P(E does not occur) = 1 — P(E does occur). 


You have now covered the material related to Screencast 2 for 
Unit 6 (see the M140 website). 


@ 


Activity 12 A small university revisited 


xs 
| mS 


Table 2 (reproduced below as Table 5) gave the breakdown of male and 
female students in a small university, by their faculty. 


Table 5 Student population of a small university 
Science Arts Law Medicine Total 


Female 964 1532 87 206 2789 
Male 1638 1039 247 369 3293 


Total 2602 2571 334 575 6082 


Suppose that a student is selected at random. 


(a) Find the probability that the student is male. Hence determine the 
probability that the student is female. 


(b) What is the probability that the student is not a medical student? 


Activity 13 Occasional truant, or not 


ga 


In relation to the data in Table 3 (reproduced in Table 4), let G be the = 
event that a child selected at random from the two schools is absent 
through truancy for 5 to 9 days. 


(a) Describe the event ‘G does not occur’. 


(b) Use the above rule to evaluate: P(G does not occur). 
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Two-course set menu 


Í 


Soup 
Pâté 
Salad 


a 


Beef 
Pasta 


—— 





2.3 Multiplying probabilities 


We have seen that probabilities are added when we have the ‘or’ linkage, 
and want P(A or B). We next consider how to determine probabilities 
when we have an ‘and’ linkage, and want P(A and B). We use the notion 
that P(A and B) is the proportion of the time that A and B both happen. 





Example 3 Two-course lunch 


A restaurant offers a two-course set lunch. There are three choices for the 
first course — soup, paté or salad — and two choices for the second course — 
beef or pasta. The different meal-combinations are shown in Figure 3. 


l beef 
soup pasta 
beef 

paté = 
pasta 


Figure 3 Tree diagram for a two-course lunch 


The diagram in Figure 3 is referred to as a tree. Starting at the left of the 
figure, we can follow one of three lines — branches — to choose a first 
course (soup, paté or salad). From each first course we can follow one of 
two lines — sub-branches — to choose the second course (beef or pasta). 
Thus there are 3 x 2 = 6 different paths we can follow, corresponding to 
the six possible meal combinations: soup—beef, soup—pasta, paté—beef, 
paté—pasta, salad—beef and salad—pasta. 


Suppose, now, that we choose a first course at random and also choose the 
second course at random. Then each of these six possibilities is equally 
likely. Thus the proportion of time we choose, say, salad followed by beef 
would be one-sixth, so 


1 
P(salad and beef combination) = 7 


Notice that there is a choice of three first courses, so if the choice is made 
at random, 


1 
P(salad for first course) = 3" 
And, as there are two choices for the second course, 


1 
P(beef for second course) = z 


Consequently, in this example 


P(salad and beef combination) = P(salad) x P(beef). 





Extending Example 3 is helpful, so suppose that there are four choices for 
the first course — soup, salad, pâté and prawns — and five choices for the 
second course — beef, chicken, fish, pasta and quiche. Following similar 
reasoning to Example 3, there are 4 x 5 = 20 different meal combinations. 


Activity 14 Two-course meal +a 


(a) Draw a diagram similar to Figure 3 to show the 20 different meal 
combinations. 


(b) Suppose the soup and salad are both vegetarian options for the first 
course, and pasta and quiche are vegetarian options for the second 
course. Write down the different combinations of courses that a 
vegetarian could have. 


(c) If a first course and second course are selected at random, what is the 
probability that the combination is vegetarian? 


(d) Suppose a first course and second course are selected at random. 
e What is the probability that the first course is vegetarian? 
e What is the probability that the second course is vegetarian? 


Check that the multiplication of these answers gives the probability 
you found in (c). 


In Activity 14 you found that 


P(vegetarian first course and vegetarian second course) 


= P(vegetarian first course) x P(vegetarian second course). 
The reason is as follows: 


P(vegetarian first course and vegetarian second course) 
number of vegetarian meal combinations 


total number of meal combinations 


number of vegetarian first courses x number of vegetarian second courses 





total number of first courses x total number of second courses 
number of vegetarian first courses number of vegetarian second courses 


x ——————— ĖĖĖŮĖŮ 
total number of first courses total number of second courses 


= P(vegetarian first course) x P(vegetarian second course). 


To extend this result, the following activity concerns three-course meals. 
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Activity 15 Three-course meal 


Suppose the menu for a three-course meal has the same choices of first 
course and second course as in Activity 14 and it additionally has a third 
course with three options: pie, crumble or ice cream. 


(a) How many different three-course meal combinations are there? 
(b) Suppose the three courses are each selected at random. 


e What is the probability that the third course would be good with 
custard (i.e. pie or crumble)? 
e What is the probability that the first two courses are vegetarian 
and the third course is good with custard? 
(c) When the three courses are selected at random, check that 


P(vegetarian first course and vegetarian second course 


and third course is good with custard) 


= P(vegetarian first course) x P(vegetarian second course) 


x P(third course is good with custard). 


We now formulate in general terms the rule we have been using which 
enables us to multiply probabilities in appropriate circumstances. To do 
this we need the concept of ‘statistical independence’. 


Two events are said to be statistically independent if the 
occurrence of one has no effect on the likelihood of occurrence of the 
other. 


In the last activity, for example, we assumed that each course was chosen 
at random, so the choice made for one course had no influence on the 
choice made for any other course. Thus, for example, the two events 
‘choosing beef for the second course’ and ‘choosing crumble for the third 
course’ are statistically independent. 


Activity 16 Independent events? 


Which of the following pairs or sets of events are independent? 


(a) Picking a person at random where the events are: (i) the person is 
taller than average; (ii) the person is heavier than average. 


(b) Picking two people at random where the events are: (i) the first person 
is taller than average; (ii) the second person is heavier than average. 


(c) Picking a person at random where the events are: (i) the person is 
taller than average; (ii) the person was born on a Tuesday. 


(d) Tossing two coins where the events are: (i) the first coin lands ‘heads’; 
(ii) the second coin lands ‘heads’; (iii) both coins give the same 
outcome. 


(e) Drawing a card from an ordinary deck of playing cards where the 
events are: (i) the card is an ace; (ii) the card is a diamond. 


Let A and B be two statistically independent events, and let ‘A and B’ 
denote the event that both A and B occur. Then the probability that A 
and B both occur is given by the following rule. 


Multiplication rule for statistically independent events 
(the ‘and’ linkage) 


If A and B are statistically independent events, then the probability 
that A and B both occur is given by 


P(A and B) = P(A) x P(B). 


This rule extends to more than two events. For example, if A, B and 
C are statistically independent events, then the probability that they 
all occur is given by 


P(A and B and C) = P(A) x P(B) x P(C). 


You have now covered the material related to Screencast 3 for 
Unit 6 (see the M140 website). 


Earlier in this section, we determined probabilities when an individual was 
picked at random from a population. In practice, often we randomly 
sample a number of individuals or a number of items from a population. 
We shall use the addition rule and multiplication rule to look at 
probabilities for samples that do not consist of just one individual but of 
several. First we will look at probabilities for samples of size 2, before 
seeing how our results can be generalised to larger samples. 


When we introduced random samples in Unit 4, we saw how they could be 
selected using random number tables. We selected a random number to 
give us the first member of the sample, then a second, and so on. If the 
same random number appeared a second time, we rejected it, because the 
same individual cannot appear more than once in a random sample. For 
the moment, we shall relax this restriction, because it makes the 
probabilities easier to find. Thus, if an individual or item is selected for a 
random sample, it can be picked again at a second selection. Later in this 
section, we shall return to proper random sampling and show that in 
practice for large populations, it makes hardly any difference to the 
probabilities. 


We shall introduce the method of finding probabilities for a sample of 
size 2 by means of an example. 
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Example 4 Sample of two 


Suppose we have a population of size 10 which contains three women and 
seven men. We shall call the three women Ashia, Brenda, and Clare, and 
the seven men David, Ejike, Frank, Gavin, Harry, Ian and Joe, and refer to 
them all by their initials. Suppose we select a sample of size 2 from this 
population, remembering that we may select the same person twice. 


How many samples of size 2 are there altogether? We must differentiate 
between the person selected the first time and the person selected the 
second time. We might pick Joe first and Clare second; we could write this 
as (J,C). Alternatively, we might pick Clare first and Joe second; this is 
(C, J). Other samples include (E, F) and (A, A), as Ashia might be selected 
twice. There are 10 possibilities for the first member of the sample and 10 
possibilities for the second member. Altogether there are 10 x 10 = 100 
possible samples of size 2. 





Activity 17 Sample of two women 


(a) For the population introduced in Example 4, write down all the 
samples of size 2 that contain two women, using their initials A, B, C. 
How many are there? 


(b) How could you have found this number without writing down all the 
samples? 


So using the results from Example 4 and Activity 17, we can now write 
down the probability that our sample of size 2 from this population of 
ten people contains two women. 


Equating probability to proportion, we can write 


P(sample of size 2 contains two women) 
number of samples of size 2 containing two women 


total number of samples of size 2 
9 


= —~ = 0.09. 
100 


We might also obtain this probability by applying the multiplication rule. 
Now the probability that the first selection is a woman is 3/10 = 0.3, as is 
the probability that the second selection is a woman. So to obtain the 
probability that both selections are women, we multiply the probabilities 
together. We shall introduce some notation to describe this result. Let W 
denote the event that at any selection a woman is chosen, and let 2W 
denote the event that two women are selected in a sample of size 2. Then 


P(2W) = P(W) x P(W) =0.3 x 0.3 = 0.09. 


We shall continue to sample from the population of Ashia, Brenda, Clare, 
.., Joe. 


Activity 18 Sample of two men 


TS 


Suppose M denotes the event that at any selection a man is chosen, and — 
2M denotes the event that two men are selected in a sample of size 2. Use 
the multiplication rule to find P(2M). 


We can use both this multiplication rule and the addition rule from 
Subsection 2.2 to find the probability that a sample of size 2 contains one 
woman and one man. To obtain such a sample, we either select a woman 
first and a man second, or a man first and a woman second. These 
possibilities are mutually exclusive events, so we add the probabilities: 
P(sample of size 2 contains one woman and one man) 
= P(a woman is selected first and a man is selected second) 
+ P(a man is selected first and a woman is selected second). 
Now we know that at any selection, P(W) = 0.3 and P(M) = 0.7. So, just 
as with a sample of two women, we multiply the probabilities together to 
obtain each of the probabilities on the right-hand side. Writing P(1W) for 
the probability that a sample of size 2 contains one woman (and therefore 
one man), we obtain 
POW) = P(W) x P(M) + P(M) x P(W) 
= 0.3 x 0.7 + 0.7 x 0.3 
= 0.21 + 0.21 = 0.42. 
We can now check our results by using the fact that our sample is certain 
to contain either two women or two men, or one woman and one man; that 
is, 
P(2W) + P(2M) + P(IW) =1. 


Substituting the calculated values, we obtain 0.09 + 0.49 + 0.42 = 1, as 
required. 
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Activity 19 Sampling from 100 people 


A population contains 40 men and 60 women. A sample of size 2 is drawn 
in which the first person selected is also available for selection the second 
time. Find the probabilities for the different possible numbers of men in 
the sample, and check that the probabilities sum to 1. 


Earlier, we decided to introduce the calculation of probabilities for samples 
of size 2 by allowing the same individual to be selected twice. Now we shall 
consider probabilities for random samples of size 2, when any individual 
can appear only once in a sample. Let us start by looking again at the 
population of Example 4 (Ashia, Brenda, Clare, ..., Joe), which consists of 
three women and seven men. We can select the first member of the sample 
in 10 ways. Then only nine individuals remain, so we can select the second 
in 9 ways. Hence the total number of samples of size 2 is 10 x 9 = 90. 


Activity 20 Sampling two different women 


How many random samples of size 2 contain two different women? 
Remember that Ashia first and Brenda second is a different sample from 
Brenda first and Ashia second, and that now the same person cannot be 
selected twice. 


Using the result of Activity 20, we can find 


P(random sample of size 2 contains two women) 


6 1 
= POW) = 90 IB 0.067 (rounded to three decimal places). 


Similarly, the probability of there being two men in the sample is 


7x6 42 7 
—— = — = — X 0.467. 
90 90 15 meat 


So, by the addition rule and the probability rule for complementary events, 
the probability of one man and one woman is 


ji 7 7 
1—{—+—]=— ~ 0-467. 
(5 +5) To 
These probabilities — 0.067, 0.467, 0.467 — are different from those we found 
before when the same individual could be selected twice: 0.09, 0.49, 0.42. 


However, this is not a realistic situation. Real-life surveys do not deal with 
populations of size 10. Suppose our population contains 1000 people (still 
a fairly small population) of which 300 are women and 700 are men. 


Now the probability of selecting a woman is 300/1000, which is still 0.3. So 
if we can select the same person twice, the probability that a sample 
contains two women is 0.3 x 0.3 = 0.09, by the multiplication rule as 
before. For a random sample in which we cannot select the same person 
twice, 


P(random sample of size 2 contains two women) 

_ 300 x 299 

~ 1000 x 999 
This is essentially the same as for the simple case when we just multiply 
0.3 by 0.3. The probability of two men is (700 x 699)/(1000 x 999) = 0.490 
rounded to three decimal places, again the same as the simple case. This is 
because with a large population, the probability of selecting the same 
individual twice is so small that it can be ignored. 


= 0.090 (rounded to three decimal places). 


For the rest of this module we shall assume that populations are large 
enough for us to apply the multiplication rule, and that the probability of 
selecting an individual possessing some characteristic remains the same for 
each selection of a random sample. 


In this section, we have introduced the concept of probability and seen 
how to calculate probabilities for various events, including the composition 
of random samples of size 2. In the next section, we shall look at 
probabilities for larger random samples, and relate these to the question on 
truancy that we posed in Section 1. 
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Exercises on Section 2 





Exercise 2 Colouring probabilities 


The hair colour and eye colour of a random sample of 6800 German men 
are recorded in Table 6. 


Table 6 Hair colour and eye colour of 6800 men 


Hair colour 


Brown Black Fair Red | Total 


Eye Brown 438 288 115 16 857 
colour Grey or green 1387 746 946 53) 3132 
Blue 807 189 1768 47) 2811 


Total 2632 1223 2829 116 | 6800 








(Source: Goodman, L.A. and Kruskal, W.H. (1954) ‘Measures of association for 
cross classifications’, Journal of the American Statistical Association, vol. 49, 
no. 268, pp. 732-64) 


Suppose that one of the men in the sample is picked at random. What is 
the probability that the man: 


(a) has blue eyes? 

(b) has brown hair? 

(c) has blue eyes and brown hair? 
(d) has brown hair or black hair? 
( 


e) does not have black hair? 





Exercise 3 Saturday afternoon 


The probability that Sue will go to watch her favourite hockey team play 
hockey on Saturday is 0.3. The probability that her team will win is 0.4. 
What is the probability that Sue will watch her team play on Saturday 
and they will win? 





3 Probability distributions from random samples 


3 Probability distributions from 
random samples 


At the end of Section 1 we began to investigate the median truancy rate of 
large schools in the East of England by taking a random sample of the 
schools. In the sample we found that there were nine values above the 
(assumed) population median, and three values below. This led us to ask 
the following question. 


How likely is it that when selecting a random sample of size 12 from 
some population, the resulting data will have nine values above the 
(actual) population median and three below it? 


This is really a question about probabilities, and we can answer it by 
applying the rules for adding and multiplying probabilities that we studied 
in Section 2. Although the question refers specifically to nine values above 
and three values below the population median, our approach will enable us 
to calculate probabilities for all possible outcomes for any size of random 
sample. To this end, we need other results for counting the number of 
ways in which an event can occur. 


3.1 Counting combinations 


Example 5 Chairperson and secretary 


Suppose a club has ten members and we want to know the number of ways 
in which a chairperson and a secretary for the club could be chosen, if they 
cannot be the same person. If the chairperson is chosen first, then there 
are 10 choices of chairperson. There remain nine people from whom to 
choose the secretary, so the total number of ways of choosing a chairperson 
and secretary is 


10 x 9 = 90. 


If the club members were the ten people in Example 4 (Ashia, Brenda, 
Clare, ...), then the 90 choices would be (A, B), (B, A), (A, C) ete. 


Note that it makes no difference whether the chairperson or secretary is 
picked first. If the secretary were picked first, then there would be 

10 choices for the secretary, and nine people would remain from whom to 
pick the chairperson. Again the number of choices for secretary and 
chairperson would be 10 x 9 = 90. 
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The Programme Committee for the 2012 European Conference on Quality 
in Official Statistics going to lunch ... or trying to avoid being appointed 
as chairperson? 


Example 6 Chairperson, secretary and treasurer 


Suppose the club with ten members want a treasurer as well as a 
chairperson and secretary. If the chairperson is chosen first, then there are 
10 choices of chairperson. If the secretary is chosen next, then there are 

9 choices of secretary. There remain eight people from whom to choose the 
treasurer, so the total number of ways of choosing a chairperson, secretary 
and treasurer (if they must be different people) is 


10x 9 x 8 = 720. 





Activity 21 Chairperson, secretary, treasurer and vice-chairman 


Suppose the club with ten members want a chairperson, secretary, 
treasurer and also a vice-chairperson. If they must all be different people, 
in how many different ways can they be chosen? 


Generalising from these examples, we have the following result. 


Number of ways of choosing an ordered sample from a set of 
objects 


Suppose there are n objects to choose from. Then the number of ways 
of choosing x objects in a specified order is 


nx (n—-1)x---x (n-—a@+4+1). 


(There are x terms in n x (n — 1) x --- x (n — x + 1).) 


3 Probability distributions from random samples 


What we would actually like to count is the number of ways we can select, 
say, three people from ten when the order in which the people are selected 
does not matter. For example, if a committee of three people is to be 
formed and they will not have individual roles, then choosing A, then C 
and then D as the committee would be the same as choosing C, then A 
and then D. 


So, how many ways can we form a committee of three people from ten 
members? 


To tackle this question we can think of the task of choosing a chairperson, 
secretary and treasurer as a two-stage procedure: first we select a 
committee of three people, and then we allocate those three people the 
roles of chairperson/secretary/treasurer. Now the number of ways of 
allocating three roles to three people is 3 x 2 x 1, as there are three choices 
of which committee member to make chairperson, then two committee 
members left for choosing the secretary, and then the one remaining 
committee member becomes treasurer. Thus, for every choice of 
committee, there are 3 x 2 x 1 ways of choosing a chairperson, secretary 
and treasurer. Consequently, 


number of ways of choosing a chairperson, secretary and treasurer 


= 3x 2 x 1 x (number of ways of choosing a committee of three people). 


Also, from Example 6, we know that the total number of ways of choosing 
a chairperson, treasurer and secretary from ten people is 10 x 9 x 8. Thus 


10x9x8 


= 3x 2 x 1 x (number of ways of choosing a committee of three people). 


So the number of ways of choosing a committee of three people from ten 
members is 


10 
eT 
3x2x1 
Activity 22 Committee of four people +a 


Suppose a club has 12 members. 


(a) In how many ways can a chairperson, vice-chairperson, secretary and 
treasurer be chosen if no person can hold more than one role? 


(b) Suppose we have selected four people. In how many ways can they be 
allocated the roles of chairperson, vice-chairperson, secretary and 
treasurer? 


(c) Hence, what is the number of ways in which a committee of four 
people can be selected, if the order of selection does not matter? 


If we make a selection, and the order in which the people or items are 
selected does not matter, then the selection is referred to as a 
combination. If there are n people, we let "C, (read as ‘n choose x’) 
denote the number of ways of choosing a Combination of x people. 
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Thus !°C3 is the number of ways of choosing, from 10 members, a 
committee of size 3. Similarly, !°C¢ is the number of ways of choosing, 
from 15 objects, a collection of 6 objects if the order in which they are 
chosen is of no importance. We have that 


10x9x8 
3x2x«K1- 


Note that there are three terms in both the numerator and denominator of 
10C3. Also, 


NG. = 


15x 14x 13x 12x 11~x 10 
6x5x4x3x2x1 ” 
where now there are six terms in both the numerator and denominator. 


TOA = 





The following summarises the results we have obtained so far in this 
subsection. 


Number of ways of choosing an unordered sample from a set 
of objects 


Suppose there are n objects to choose from. Then the number of ways 
of choosing x objects if the order does not matter is 


_ number of choices of x objects if order does matter 
= 


n a A era SM Cees Eas pe ae ME ee, 
number of ways in which x objects can be ordered 


DOS @ We ae) 

T mx (=li l 
(There are x terms in both n x (n — 1) x --- x (n — x + 1) and 
g z (= I) x ee x T) 


For any value of n, "Co = 1 and "C,, = 1. This can be seen directly 
from the definition, because there is only one way that zero objects 
can be selected from n objects and only one way that n objects can 
be selected from n objects. 


3 Probability distributions from random samples 


Activity 23 Combinations 
Calculate 8C3, “Cs and 4C\. 





More combinations? (of clothes, that is) 


3.2 Probabilities of combinations 


Here we again suppose that we have a sample of observations from a 
population. How many of these observations are greater than the 
population median will partly be a matter of chance. In this section we 
aim to determine the probability that the number will be 1, the probability 
that it will be 2, the probability it will be 3, and so forth. This will answer 
the question posed at the start of Section 3: 


If we have a sample of size 12, what is the probability that exactly nine 
values will exceed the population median? 


When the size of the sample exceeds 5 or 6, we will need our results about 
counting combinations in order to determine the probabilities we want. We 
shall start, though, by considering the simplest possible case: a random 
sample of size 1. What is the probability that the selected value lies above 
the population median? We shall write this probability as P([+]), and we 
shall write P({—]) for the probability that the selected value lies below the 


population median. Using the definition from Section 2, we obtain 
P([4)) = number in population above median 
7 total number in population 


The definition of the population median is that half the population lies 
above it and half below it, so 


Similarly, P([-]) = 5. 
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Since the selected value must lie either above or below the median, one of 
We are ignoring the possibility these events must happen; that is, it must be the case that 
that the value is exactly equal to 
P+) + P({[-) = 1. 


the population median. 
Our calculated probabilities agree with this (5 + 5 = 1). 


Moving on to a sample of size 2, let ([+], [—-]) mean that the first 
observation was [+] and the second was [—]. Then 








P((-}, [-) = PH) x P(-) =} $= (H? 
P({-], H) = P(-)) x PIH) = (4)? = 4, 
PHH = PH) x PUD = (2)? = 3, 
PH, H) = PH) x PH) = G) = 4 
If we let P(a[+]) denote the probability that the number of [+] is x, then 
POH) = P(-],[-]) = 4 and PH) = PH, H) = 4, while 
POD = P(-L 4) + PH.) =444=3. 


= - as 
Exactly one of the events 0[+], 1[+] and 2|+] happens, so 
P(O[+]) + POH) +PEH) =44+44+4=1. 





A probability distribution gives the probability of each possible 
event that could occur. 


When the number of possible outcomes is small, a probability distribution 
is easily presented as a table or a bar chart. Figure 4 shows the probability 
distributions that were calculated above for a sample of size 1 and a 
sample of size 2. 















































Sample of size 1 
7o 
Number of [+]s, x 0 1 a 
a, 
P(2|+]) 23 3 
0 1 
g 
Sample of size 2 
Tl 
ceo 
Number of [+]s, x 0 1 2 8 
a, 
P(el+) 13 4 ad 
M l 2 
g 


Figure 4 Probability distributions for samples of size 1 and size 2 
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Activity 24 Probability distribution for a sample of size 3 


Ta 


T5 
xE 
Suppose a sample of three observations is to be taken from a population. —_ 
(a) For each of the four numbers 0, 1, 2, 3, determine the probabilities 

that the number of observations greater than the median will be the 

same as it. Check that these four probabilities sum to 1. 


(b) Present these values in a table and draw a bar chart of the probability 
distribution. 


Calculations to obtain the probability distribution get longer as the sample 
size increases. Hence, for samples larger than 3 we need a better method 
for finding these probabilities. Let us consider the case where the sample 





size is 5. 
Now the probability of the sequence [+], [—], [—], [+], [+] equals 
deg) Ue ag dg? Mane 1y5 
aX 9X 9X 9x9= (9). 
Similarly, the probability of the sequence [—], [—], [-], [+], [—] equals 





1 
2 


1 EFEO E 

xaxaxa šm) 

Indeed, the probability of any sequence that we specify will equal Ge. 
Consequently, 


P(number of [+]s equals z) 
= (number of sequences that have æ [+]s) x (dy. 

Now to get a sequence that has x [+]s, we must just choose x of the five 
observations to be [+], leaving the remainder of them to be [—]. For 
example, if x is 2 and we label the five observations A, B, C, D and E, 
then the two [+] values could be chosen as A and C, for example, or A and 
E, or D and B, and so on. Hence, the number of sequences that have x 
[+]s is "Cz, so 

P(Number of [+]s equals x) = °C, x (yp 


More generally, we have the following result. 


Suppose we take a random sample of size n. Then 


Iye 


is the probability that exactly x of these observations are greater than 
the population median. 


(The formula above is a special case of a probability distribution known as 
the ‘binomial distribution’.) 


You have now covered the material related to Screencast 4 for E 
Unit 6 (see the M140 website). a- 


39 


Unit 6 Truancy 


40 





Example 7 Probability distribution for a sample of size 5 
A random sample of size 5 is selected. 


1. What is the probability distribution for the number of values that lie 
above the population median? 


2. What is the probability that exactly four of the selected values lie 
above the population median? 


3. What is the probability that at least four values lie above the 
population median? 


For the first question, applying the formula gives: 














P(O[+]) =°Co x VP iy = 0.031 25 
ee A a 
iy’ 5 /1\ř 5 
5 
foes E ~x(-) =—=0.1562 
P(1[+]) ci x (5) > x (5) zz = 0-156 25, 
1\> 5x4 /1\° 10 
PH) =" x [=| = x |=] = — =0.3125, 
2 2x1 \2 32 
5 5 
1 1 
P(3[+]) = "C; x ARE a ce — © _ 9.3125, 
2 3x2x1  \2 32 
5 5 
4 ae gil 
P(4[+]) =°Cy x i en Ea = > = 9.15625, 
2 4x3x2x1l 2 32 
5 5 
1 5x4x3x2x1 1 1 
P54) = 5 | ee S E Ee 
EEI sx (5) epee” (5) 32 


2 


These form the probability distribution, and the following table displays it 
in a clear form. 
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Table 7 Probability distribution for a random sample of size 5 


Number of [+]s,z2 0 1 2 3 4 5 
1 5 10 10 5 1 
32 32 32 32 32 32 





P(a[+]) 


For the second question, we can now read off the first probability we 
require: 


P(4 = —. 
uH == 
The answer is quite satisfactory in this form. There is no need to write it 

as a decimal. If you do, it is reasonable to round it to three decimal places 


and write P(4[+]) ~ 0.156. 


The third question asks for the probability that at least four values in the 
sample lie above the population median. The only possibilities are four 
values or five values. As these are mutually exclusive events, we use the 
addition rule: 


3 Probability distributions from random samples 


P(at least 4[+]s) = P(4[+] or 5[+]) 





= P(4[+]) + P(S[+]) 
5 1 
= + 37 (from Table 7) 
ae 
32 
~ 0.188. 





Activity 25 Probability distribution for a sample of size 4 


A random sample of size 4 is selected. 


(a) Calculate the probability distribution for the number of values that lie 
above the population median. Check that the probabilities are all 
positive and add to 1. 


(b) What is the probability that: 


e two of the selected values lie above the population median and 
two lie below it? 


e all the selected values lie on the same side of the population 
median? 


In the question posed at the end of Section 1, we looked at data from a 
random sample of 12 large schools in the East of England and asked the 
question: 


How likely is it that a random sample of 12 would contain nine values 
above and three values below the assumed population median? 


We repeated the question at the start of Section 3. You are now able to 
answer it! 


Activity 20 Probability that three out of 12 are below the median 


Suppose we take a random sample of size 12. What is the probability that 
there are exactly 9 values above the median? 


Later we will need the complete probability distribution for the number of 
values that exceed the population median in a random sample of size 12. 
Determining this would be tedious using a calculator, so a computer would 
always be used. The probability distribution that would be obtained is 
given in the form of a diagram in Figure 5. The probabilities in the figure 
are rounded to three decimal places. 


| Pes 
"g 
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P(z[+]) 
©| 0.000 
=] 0.003 
w 0.016 
co D 0.054 
rr 0.12: 
a 0.193 


Figure 5 Probability distribution for a random sample of size 12 


We can read off from this diagram the probability of any specific outcome. 
In Section 4 we shall see how to use this information to make an inference 
about the population from the sample result. 


Exercises on Section 3 





Exercise 4 Flag signals 


A box contains seven flags, each of a different colour. A signal is made by 
flying three of the flags on a vertical flagpole. 


(a) How many different signals can be made? (Note that a signal depends 
on the order of the flags on the flagpole as well as their colour.) 


(b) When a signal is flying, how many different combinations of flags could 
be left in the box? 





Exercise 5 Tossing a coin five times 


What is the probability that you get exactly three heads when you toss a 
coin five times? 
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In Subsection 1.2 we introduced a measure of truancy: a pupil’s truancy 
rate is the proportion of days that the pupil was absent without 
authorisation, and a school’s truancy rate is the average truancy rate of its 
pupils. The median truancy rate for all secondary schools in the East of 
England was 0.98%, and we wondered how the median truancy rate for 
large secondary schools in the East of England compared with this. 


In a random sample of 12 large (secondary) schools in the East of England, 
9 had a truancy rate above 0.98%. This is more than you would expect if 
the median for large schools were truly 0.98% — in a sample of 12 you 
would get 6 values above the median, on average. The question is: 


Given these sample data, could we reasonably believe that the truancy 
rate in large schools in the East of England is 0.98%? 


The procedure of using probability to examine this type of question — 
where sample data are used to evaluate the credibility of a statement — is 
called a hypothesis test. We will first give the steps in a hypothesis test 
and then discuss them. 


Steps in a hypothesis test 


1. Make a statement about the population of interest (e.g. the 
median truancy rate of large schools in the East of England is 
0.98%). This is the hypothesis we wish to test. 


2. Under the assumption that the hypothesis is true, determine the 
probability distribution for all possible values of some sample 
statistic (e.g. determine the probability distribution for the 
number of large schools, out of 12, that will have a truancy rate 
above 0.98%). 


3. Now take the sample and ascertain how unlikely the observed 
value of the sample statistic is, on the basis of (1) and (2) (e.g. 
are we very unlikely to get a sample statistic as large as 9, that 
is, a sample of 12 schools in which nine or more schools have a 
truancy rate above 0.98%). 


4. Ifthe sample turns out to have a very unlikely value, then either: 
e avery unusual event has happened, or 


e the hypothesis suggested in step 1 is incorrect, in which case 
the sample has provided evidence, albeit in a negative way, 
that adds something to our beliefs about the population. 
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until proven guilty. 
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Usually the hypothesis stated in step 1 is something that we do not really 
believe and would like to disprove. This is similar to a trial in a law court, 
and, indeed, hypothesis tests have much in common with a law-trial: 


e Ina law-trial, the hypothesis is that the person in the dock is 
innocent. The prosecution do not really believe this, though, which is 
why the person is in the dock. 


e The evidence is examined: a witness identified the defendant as being 
at the scene of the crime; the defendant has no alibi for the time of the 
crime; a footprint matches one of the defendant’s shoes; etc. The 
fundamental question is: how likely are these occurrences if the 
defendant is innocent? 


e When the events are extremely unlikely to have arisen if the defendant 
is innocent, it is concluded that the assumption of innocence is wrong, 
and the defendant is found ‘guilty’; i.e. it is concluded that, beyond all 
reasonable doubt, the hypothesis of innocence is wrong. 


There have been cases where probabilities could be calculated under 
the assumption of a defendant’s innocence. This raises the question: 
in a law trial, when is a probability sufficiently small to equate to 
‘beyond all reasonable doubt’? 


4.1 Tackling the problem 


Much of the content in Units 6 to 8 is concerned with making a hypothesis 
about a population, and then analysing samples to test whether the 
hypothesis is reasonable. We shall meet several different hypothesis tests; 
the one we introduce here is called the sign test because it is based on the 
number of [+]s and [—]s. 


Before giving a formal test, the following activity asks you to make your 
own intuitive judgement about whether sample data indicate that a 
hypothesis is wrong. The hypothesis is that the median truancy rate of 
large schools in the East of England is 0.98. The data are the truancy 
rates in a random sample of 12 such schools. 


Activity 27 What is your judgement? 


For each of the following instances of random samples of 12 schools, would 
you think the hypothesis ‘The median truancy rate of large schools in the 
East of England is 0.98’ is (i) quite possibly true, (ii) probably wrong, or 
(iii) almost certainly wrong? 


(a) All 12 values are above the hypothesised median. 
(b) There is 1 value above and 11 values below the hypothesised median. 
( 


c) There are 6 values above and 6 values below the hypothesised median. 
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(d) There are 7 values above and 5 values below the hypothesised median. 


(e) There are 9 values above and 3 values below the hypothesised median. 


The solution to Activity 27 suggests that if all, or nearly all, the values lie 
on the same side of the assumed median, as in (a) and (b), then we should 
conclude that the hypothesis is almost certainly wrong. If the values are 
fairly evenly distributed on both sides of the assumed median, as in (c) 
and (d), then we should conclude that the hypothesis is quite possibly true. 
When the outcome is between these two, as in (e), then we are not sure. 


Clearly sample data can cast doubt on the truth of a hypothesis. But how 
much doubt? Just using intuition to judge this is unsatisfactory. Given 
sample data, we need an objective way of quantifying the evidence that a 
hypothesis is wrong. In hypothesis testing, we use the probability 
distribution of all possible outcomes to measure the evidence that a 
hypothesis is wrong. 


Figure 5 gave the probability of each possible outcome when we take a 
sample of size 12 and the probability of [+] for each item is 5: Figure 5 is 
reproduced here as Figure 6. 
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Figure 6 Probability distribution for a random sample of size 12 
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In a court of law, an unequivocal decision must be made as to whether or 
not a defendant is guilty. Here, for the moment, we shall assume that we 
must also decide one way or the other on whether or not to reject the 
hypothesis about the median. Thus we want a rule — a way of deciding — 
that tells us how extreme the outcome has to be for us to reject the 
hypothesis. In Figure 6 you can see that all the individual probabilities are 
quite small; even the probability of getting 6/+]s and 6[—]s is only 0.226 
(about z, so between 1-in-4 and 1-in-5 chance). However, the extreme 
probabilities are much smaller than this. P(O[+]) and P(12[+]) are so 
small they appear as 0.000. 


So if the hypothesis is true, then the outcome 0[+] is extremely unlikely to 
occur, as is the outcome 12{+]. This supports our intuitive feeling that if 
we did select a random sample with either 0[+] or 12[+], then we should 
conclude that the hypothesis is almost certainly wrong. So one possible 
rule would be to reject the hypothesis if either of these extreme outcomes 
occurs. However, this rule is erring on the cautious side; it does not cover 
(b) in Activity 27, which is also very extreme. Another possible rule would 
be to reject the hypothesis if the sample contains 0[+], 1[+], 11[+] or 12[+]. 


Activity 28 Probabilistic judgement 


(a) Use the probability distribution of Figure 6 to find the probability that 
a random sample of size 12 has one of the outcomes 0[+], 1[+], 11/+], 
12[+]. 


(b) On the basis of your result in part (a), do you think it is reasonable to 
reject the hypothesis if a sample is selected with one of these 
outcomes? 


The probability of getting one of these four extreme outcomes is only 0.006 
— that is, 6 chances in 1000 or about 1 in 170. Since this probability is also 
very small, perhaps we are still being over-cautious if we decide to reject 
the hypothesis only if one of these four outcomes occurs. It might be 
sensible to incorporate the next most extreme outcomes, 2{+] and 10[+], 
into our rejection rule, and possibly also 3[+] and 9[+]. 


When you first come across the idea, you might think it sensible to reject 
the hypothesis only if there is a very small probability of the observed 
result. For example, you might think it a good idea to reject if all 12 
observations are on the same side of the median. However, this means that 
we would not reject the hypothesis if 11 observations were above the 
median and only one below it. In such a case, it is much more likely that 
the hypothesis is wrong and the median is in fact larger than 0.98%. It is a 
matter of compromise; if we are over-cautious, we will often not reject a 
hypothesis which is false. We have to remember that we are working with 
probabilities when we look at samples. We cannot draw conclusions with 
certainty; there will always be some doubt. 


How far should we continue? We shall go on adding extreme values into 
our rejection rule until their combined probability is too large for us to feel 
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justified in rejecting the hypothesis. What is an acceptable value for this 
probability? This depends on many factors like the importance of the 
decision, and we shall discuss this in later units. For the present, we shall 
choose a probability of 0.05 or 5%. This value is used very frequently in 
many practical situations, and is universally accepted as a reasonable 
choice. 


Significance levels 
We refer to 5% as the significance level of our hypothesis test. 


If a sample is selected whose values are one of the 5% most extreme 
outcomes that might occur if the hypothesis were true, then we reject 
the hypothesis at the 5% significance level. 


Note that this statement does not say ‘we reject the hypothesis with 
absolute certainty’. Rather, the statement gives the probability that 
rejecting the hypothesis would be the wrong decision. 


Use of the 5% significance level is illustrated in Figure 7; this is a general 
picture for any size of sample. The actual probabilities will be different for 
different sample sizes, but the general pattern is the same. The horizontal 
axis represents the number g of [+]s in the sample, and the vertical axis 
represents the probability of obtaining different values of x. The 5% most 
extreme values are shaded. 


P(a[+}) 

















The probabilities in the 
shaded areas add up to 5 or 0.05 





Figure 7 Tail areas adding to 5% 


We shall now apply the rule to our random sample of size 12. First, notice 
that the probability distribution is symmetric, so we can start by 
considering the left-hand side only, as indicated by the shading in Figure 8. 
This means finding the outcomes in the left-hand tail whose probabilities 
add up to 5 = 0.025 or 25%. Because the distribution is symmetric, there 
will be a corresponding 0.025 of probability in the right-hand tail, and the 
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Notice that we only have to add 
the next probability to the 
previous total. 
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two taken together will add up to the required 0.05 or 5%, as shown in 
Figure 8. 












The probabilities in this 
shaded area add up to aa or 0.025 








Figure 8 Symmetric distribution 


Using the probability distribution of Figure 6 and the addition rule, we 
find the following probabilities for a random sample of size 12: 

P(0[+]) = 0.000, 

P(O[+] or 1[+]) = P(0[+]) + P(1[4+]) ~ 0.000 + 0.003 = 0.003, 

P(O[+] or 1[+] or 2[+]) ~ 0.003 + 0.016 = 0.019, 

P(O[+] or 1[+] or 2[+] or 3[+]) ~ 0.019 + 0.054 = 0.073. 
The probability of 0, 1 or 2 [+]s is 0.019 which is less than 0.025, whereas 
if we include 3[+]s the probability is greater than 0.025; neither is exactly 
equal to 0.025. We shall err on the side of caution and take the value 


which is less than 0.025. So we shall reject the hypothesis if a sample is 
selected with one of the outcomes 0[+], 1[+] or 2/+]. 





We shall also reject the hypothesis at the 5% significance level if the 
outcome is one of the corresponding extremes on the right-hand side, that 
is 10[+], 11[+] or 12[+]. To emphasise the symmetry, it is easier to think of 
these outcomes as 2[—], 1[—] and O[—]. So for a sample of size 12, we shall 
reject the hypothesis if the number of values on one side of the assumed 
median is 2 or fewer. If our hypothesis is true, the probability that there 
are two or fewer values on one side of the median is 2 x 0.019 = 0.038. The 
probability that there are three or fewer values on one side of the median 
is 2 x 0.073 = 0.146 (about 2). If we decided to reject the hypothesis when 
we observed three or fewer values on one side of the median, we would have 
a one in seven chance of rejecting the hypothesis when it was correct. Most 
people consider this an unacceptably high chance, and so we decide to 
reject the hypothesis if two or fewer values are on one side of the median. 


4 Testing hypotheses | 


The number 2 is called the critical value at the 5% significance level 
for a sample of size 12. We can use it to test a hypothesis that any random 
sample of size 12 comes from a population with some assumed median M. 
We simply count the number of values above and below M and check 
whether the smaller number of values is 2 or fewer; if it is, we reject the 
hypothesis. 


This procedure does not apply only to random samples of size 12; for other 
sample sizes, we proceed in just the same way, though the critical value is 
different. In the next activity, you will find the critical value for a sample 
of size 15. 


Activity 29 Sample of 15 schools + 
[x] 


B 


Suppose that we want to test the hypothesis that the population median is 
equal to 0.98%, and that we have a random sample of size 15 available. 
The probability distribution for this situation is shown in Figure 9. 
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Figure 9 Probability distribution for a random sample of size 15 
(a) Calculate the following probabilities: 
P(O[+]) + POH) 

















e = P(O[+]) + PAH) + PL) 
© = P(O[+]) + PAH) + P2H) + Plt) 
e = P(O[+]) + PIF) + PE] + Plt) + PE) 





(b) Which of your answers in part (a) is the largest value below 0.025? 


(c) Write down the 5% most extreme outcomes, and state the critical 
value at the 5% significance level. 


(d) Would you reject the hypothesis at the 5% level if the sample 
contained 14 values below the assumed median and 1 above it? 
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(e) Would you reject the hypothesis at the 5% level if the sample 
contained 5 values below the assumed median and 10 above it? 


(£) Would you reject the hypothesis at the 5% level if the sample 
contained 12 values below the assumed median and 3 above it? 


In Activity 29 you found that the critical value at the 5% significance level 
for a random sample of size 15 is equal to 3. In the same way, you could 
calculate the critical value for any size of sample, though you are not 
expected to do that in this module. 


A typical probability distribution is shown in Figure 10. The shaded 
region corresponds to the outcomes 


oH, 1[+],...,Cl4] 


and 


where C is the critical value at the 5% significance level. The shaded area 
is called the critical region at the 5% significance level. 


Probability + 











O[—] Outcomes 





Critical region 


Figure 10 Critical region 


Critical region and critical value 


The critical region at the 5% level is chosen so that the combined 
probabilities of the outcomes falling in the region is 0.05 or just less 
than that. 


The critical value at the 5% significance level can be used to 
determine whether or not an outcome is in the critical region. 


4 Testing hypotheses | 


For a sample of size 15, the critical value is C = 3, and the critical region Remember that for a sample of 
contains the outcomes 0[+], 1[+], 2[+], 3[+], 0[—], 1[-], 2/—] and 3[-]. size 15, 2[+] and 13[—] describe 
From Activity 29, it follows that the combined probability of these the same outcome. 

outcomes is 2 x 0.017 = 0.034, which is less than 0.05. 


The procedure can be used to calculate critical values and critical regions 
for samples of any size. Figure 10 illustrates the critical region for a 
moderately large sample. 


The critical values at the 5% significance level have been calculated for 
random samples of sizes 1 to 40 and are given in Table 8. (The table could 
be extended to sample sizes greater than 40, but that is unnecessary for 
this module.) 


Table 8 Critical values at the 5% significance level 


Sample Critical value at the 5% Sample Critical value at the 5% 


size significance level size significance level 
1 — 21 5 
2 — 22 5 
3 -— 23 6 
4 -— 24 6 
5 -— 25 T 
6 0 26 7 
T 0 27 T 
8 0 28 8 
9 1 29 8 
10 | 30 9 
11 1 3l 9 
12 2 32 9 
13 2 33 10 
14 2 34 10 
15 3 35 11 
16 3 36 11 
17 4 37 12 
18 4 38 12 
19 4 39 12 
20 5 40 13 


For the sample sizes 1, 2, 3, 4 and 5, no critical value is given. To see why 
this is the case, consider a sample size of 4, for example. We found the 
appropriate probability distribution in Activity 25 (Subsection 3.2). The 
probability of 0[+] is 7; = 0.0625, the same as the probability of 4[+]. So 
the probability of the most extreme outcomes, 0[+] or 4[+], is t = 0.125, 
which is greater than 0.05. So no sample is extreme enough for us to be 
able to reject the hypothesis at the 5% significance level. The same 
situation applies for sample sizes 1, 2, 3 and 5. 
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Activity 30 Critical values 


Use Table 8 to find the critical value at the 5% level for the following 
sample sizes. For each sample, specify the outcomes for which the 
hypothesis would be rejected at the 5% significance level. 


(a) Sample size 21 (b) Sample size 32 (c) Sample size 7 


So we can easily decide whether a given sample outcome should lead us to 
reject a hypothesis at the 5% significance level. We simply count up the 
number of [+]s and the number of [—]s in the sample. If the smaller of 
these two numbers is less than or equal to the critical value, then we reject 
the hypothesis. 


4.2 The sign test 


In Subsection 4.1, we introduced the hypothesis test known as the sign 
test. You will meet other hypothesis tests later in this module. In this 
subsection we describe the procedure for the sign test. 


All the information we require from the sample is the smaller of the 
numbers of [+]s and [—]s it contains. This number is called the test 
statistic. If the test statistic is less than or equal to the critical value, 
then we reject the hypothesis at the 5% significance level. 


Suppose we want to test whether some population has a median value of 
M. Then we select a random sample and apply the following procedure. 


Procedure: the sign test 

1. State the hypothesis that the population median is M. 

2. Count the number of values in the sample that are larger than M 
(denoted by [+]s) and the number of values that are smaller than 
M (denoted by [—]s). The smaller of these two values is the test 
statistic. 

3. Use Table 8 (at the end of Subsection 4.1) to write down the 
critical value at the 5% significance level corresponding to the 
size of the sample. 

4. Compare the test statistic with the critical value. If it is less than 
or equal to the critical value, then the hypothesis is rejected at 
the 5% significance level. If the test statistic is greater than the 
critical value, then the hypothesis is not rejected. 


These steps can be summarised by the flow chart given in Figure 11. 








Set up 
HYPOTHESIS 







Find value of 
TEST STATISTIC 







Look up 
CRITICAL VALUE 








Is test statistic 
< critical value? 


Do not reject 
hypothesis 


Figure 11 Steps in the sign test 


To illustrate how to apply the sign test, let us complete the example we 
started in Section 1 and continued at the beginning of this section. We 
wanted to find out whether the truancy rate in large secondary schools in 
the East of England (those with over 1000 pupils) was the same as the 
overall rate for all secondary schools in the region. The truancy rate was 
recorded for a random sample of 12 large schools. We want to test whether 
this sample comes from a population whose median is 0.98%, the overall 
truancy rate for the East of England. 


So we set up the hypothesis that the population median rate for large 
schools in the East of England is indeed 0.98%. 


In the sample, we count the number of values above and below the 
assumed median of 0.98%. This gives 9 [+]s and 3 [—]s. The smaller of 
9 and 3 is 3, so the test statistic is 3. 


From Table 8, the critical value at the 5% significance level for a random 
sample of size 12 is equal to 2. 


4 Testing hypotheses | 
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We then compare the test statistic with the critical value. 


Since 3 is greater than 2, we cannot reject the hypothesis at the 5% 
significance level. This means that the sample we have observed could 
quite possibly have been drawn from a population with a median of 0.98%; 
there is insufficient evidence to suggest otherwise. 


So hypothesis testing provides a method of inferring back from the sample 
to the population. However, it only enables us to reach a statistical 
conclusion, and there is more to interpreting results than this. We shall 
discuss this in Subsection 5.3, but for the moment you should concentrate 
on the statistical technique. 


Activity 31 Sample of 23 small schools 


In the example, we looked at data from large schools with over 1000 
pupils. The data below show the truancy rates from a random sample of 
23 small (secondary) schools (those with fewer than 500 pupils) in the East 
of England. You are asked to investigate whether the median percentage 
truancy rate for small schools is the same as the median percentage 
truancy rate for all schools in the East of England. 


0.70 0.73 0.16 1.76 0.95 0.80 1.48 0.96 0.64 2.80 0.52 0.96 
0.36 0.10 1.21 0.04 0.83 0.64 0.71 0.16 0.71 0.75 0.71 


(a) Write down the hypothesis to be tested. 


(b) Record the number of values lying above and the number lying below 
the assumed median. Hence write down the test statistic. 


(c) What is the appropriate critical value at the 5% significance level? 


(d) Decide whether you would reject the hypothesis at the 5% significance 
level. 


You have now covered the material related to Screencast 5 for 
Unit 6 (see the M140 website). 


So far, we have applied the sign test to questions about truancy rates. It 
can be used in very many fields of application. One such application is 
shown next. 


Activity 32 Comparing two corn hybrids 


An experiment was conducted to compare two different hybrid lines of 
corn, Hybrid A and Hybrid B, with 28 plots used. Conditions were 
uniform within a plot, but differed between plots. To treat the hybrids 
comparably, each plot was divided in two: Hybrid A was grown on one half 
and Hybrid B on the other. The differences in yields from the two hybrids 
(Hybrid A — Hybrid B) are given in Table 9. 


Examine whether one hybrid is better than the other by testing the 
hypothesis that the median difference between the two hybrids is 0. 
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Table 9 Differences in yield between two corn hybrids 


17 =—Ls> —06 =5.6 =1:3 =19 =10.3 =21 -—06 06 
—0.7 -05 01 07 O9 -11 —0.5 —1.0 0.2 
—0.9 —0.2 —0.4 0.2 —0.7 —1.4 -04 —0.9 —1.5 


(Data source: Dixon, W. J. and Mood, A. M. (1946) ‘The statistical sign test’, 
Journal of the American Statistical Association, vol. 41, pp. 557-566) 


Exercises on Section 4 





Exercise 6 Testing a claim about petrol consumption 


In Unit 1 (Section 3) we introduced values of petrol consumption for a 
Honda Civic 1.4i. The stemplot of these data is reproduced in Figure 12. 





Figure 12 Stemplot of the petrol consumption data 


Suppose that a dealer claimed that this type of car would give 37 miles to 
the gallon. Use the sign test to examine the truth of this claim. 
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Exercise 7 Do mice like mirrors? 


An experiment was conducted to examine whether mice liked to have a 
mirror in their cage. There were 15 pairs of cages used. The cages in a pair 
were linked, and one cage in each pair had a mirror in it. A different 
mouse was placed in each pair, and the time that it spent in each cage was 
recorded. Three of the 15 mice spent more time in the cage with the 
mirror, while the other 12 mice spent more time in the cage without the 
mirror. 


Test the hypothesis that the presence/absence of a mirror does not 
influence where a mouse spends its time. 


5 Testing hypotheses II 


In Section 4, you were introduced to hypothesis testing and the sign test in 
particular. In this section, you will learn more about hypothesis testing. 
We start with an approach that does not reduce the conclusions down to 
just ‘reject’ or ‘do not reject’. 


5.1 Significance probabilities: p-values 


From Table 8 (Subsection 4.1), the critical value at the 5% significance 
level for a sample of size 20 is 5. Suppose, though, that in a random 
sample of 20 the number of values above the median (the number of [+] 
values) is only 3. Then, not only do we reject the hypothesised median at 
the 5% significance level, but we reject it very comfortably, as 3 is almost 
as close to 0 as it is to 5. Figure 13 gives the probability distribution for 
the number of values above the median in a random sample of size 20. 
From the figure, the probability of three or fewer [+] values is only 

0.000 + 0.000 + 0.000 + 0.001 = 0.001. Similarly the probability of three or 
fewer [—] values is 0.001. Hence, if we used a 0.2% significance level (rather 
than a 5% significance level), then the critical region would contain the 
outcomes 0[+], 1+], 2{+], 3[+], 0[—], 1[—], 2[-] and 3[—], as the combined 
probability of these outcomes is 2 x 0.001 = 0.002. Stating that we reject a 
hypothesis at the 0.2% significance level is a much stronger statement than 
saying we reject it at the 5% significance level. Consequently, we would 
often like to be more precise when reporting the result of a hypothesis test, 
rather than simply saying how it compares with the 5% significance level. 


5 Testing hypotheses II 
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Figure 13 Probability distribution for a random sample of size 20 


When we observe three [+] values in a sample of 20, we refer to 0.002 as 
the significance probability or, more commonly, as the p-value of the 
hypothesis test. If we set our significance level at anything larger than 
0.2% (for example, we might set it to 5%), then we would reject the 
hypothesis, while if we set our significance level to anything smaller than 
0.2%, then we would fail to reject the hypothesis. 


Procedure: Obtaining p-values 


To obtain the p-value (significance probability) for a hypothesis test, 
work through the following steps. 


1. Assume the hypothesis is true. 
2. Consider all the possible outcomes and divide these into two sets: 


Set A contains those outcomes that are as extreme or more 


extreme than the outcome that actually occurred. Set A is the smallest critical 
‘ 7 region that contains the 
Set B contains those outcomes that are more likely than the outcome that actually occurred. 


outcome that actually occurred. 


3. Calculate the probability that a random outcome would be from 
Set A. This probability is the p-value. 


When data are analysed on a computer using a standard statistical 
package, the output almost always reports the result of a hypothesis test 
as a p-value. 


A small p-value indicates that one of the more unlikely outcomes has 
occurred or the hypothesis that led to the p-value is false. As the p-value 
decreases, such an outcome becomes less likely and the evidence against 
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the hypothesis increases. Table 10 gives a reasonable way of interpreting 
different p-values. Notice that we never conclude that the hypothesis is 
true — a large p-value only means that there is little to suggest that the 
hypothesis is false. 


Table 10 Interpretation of p-values 


p-value Rough interpretation 


p > 0.10 Little evidence against the hypothesis 

0.10 > p > 0.05 Weak evidence against the hypothesis 

0.05 > p > 0.01 Moderate evidence against the hypothesis 
0.01 > p > 0.001 Strong evidence against the hypothesis 
0.001 > p Very strong evidence against the hypothesis 


At the end of a hypothesis test it is helpful to give a verbal description of 
the result and state the associated p-value in order to add precision. For 
example, we have focused on the hypothesis that the median truancy rate 
for large schools in the East of England is 0.98%. Using Figure 6, the 
p-value for the test when there are three [—] values out of 12 is 


P(O[-]) + PUȚ- + P@I-}) + Pl) 
+P(0[+]) + POH) + PH) + POL) 

= 2 x [P(0[+]) + POH) + PEH) + PH) 

= 2 x (0.000 + 0.003 + 0.016 + 0.054) = 0.146. 


We might say, ‘The p-value is 0.146, so there is little evidence against the 
hypothesis that the median truancy rate in large schools in the East of 
England is 0.98%.’ Similarly, if there were only two [—] values out of 12, 
then the p-value would be 2 x (0.000 + 0.003 + 0.016) = 0.038, and we 
might say, ‘The p-value is 0.038, so there is moderate evidence that the 
median truancy rate in large schools in the East of England is not 0.98%. 
The data suggest that the truancy rate is higher than that.’ 


You have now covered the material related to Screencast 6 for 
Unit 6 (see the M140 website). 


Activity 33 Essex schools: testing with p-values 


Essex is a county in the East of England region. To compare their truancy 
rates with the regional median of 0.98%, a random sample of 15 
(secondary) schools in Essex was selected and their truancy rates recorded. 
These rates were as follows: 


2.04 1.86 1.64 1.84 0.32 0.62 1.57 1.18 
0.95 1.50 0.62 1.46 1.27 0.63 1.44 


(a) What is the hypothesis to be tested? 


(b) Record the number of values lying above the median, and the number 
lying below it. Hence write down the test statistic. 


5 Testing hypotheses II 


(c) Using Figure 9 (Subsection 4.1), calculate the p-value given by the 
hypothesis test. 


(d) Making reference to this p-value, write down the conclusion to be 
drawn from the hypothesis test. 


Activity 34 Truancy rates in academies 


Suppose we took a random sample of secondary school academies in the = 
East of England and examined their truancy rates. For each of the 
following sample results, test whether the median truancy rate for 


academies could be 0.98%. Use Figures 6 and 9 
(in Subsection 4.1) for the 


(a) The sample size is 12, with 11 values above 0.98% and 1 value below it. probabilities ou need, 
(b) The sample size is 15, with 3 values above 0.98% and 12 values 


below it. 


5.2 The sign test with ties 


In the examples considered so far, there has never been a case in which 

one, or more, of the sample values is actually equal to the assumed 

population median. When this does happen, we shall refer to the situation 

as a tie. This problem is dealt with quite easily: we discard the tied values 

and reduce the size of the sample accordingly. The procedure is Some statisticians deal with ties 


demonstrated in Example 8. in other ways. In this module we 
use the procedure described. 





Example 8 Testing with a tie 


In this unit, truancy rates have been re-calculated from government figures 
so as to determine them with an accuracy of two decimal places. In the 
government publication, the truancy rates are only given to an accuracy of 
one decimal place. Using the published rates, the median truancy rate for 
secondary schools in the East of England would be 1.0% (rather than 
0.98%). 


The data for the 15 schools in Essex (Activity 33) become: 


2.00 1.9 16 1.8 03 06 1.6 1.1 
1.0 15 06 1.5 13 06 1.4 


Suppose we want to use these data to test whether the median truancy 
rate in Essex differs from a population whose median is 1.0%. There are 10 
values above and four values below 1.0%, and one value actually equals 
1.0%. We could write this as 


ten [+]s, four [—]s and one [=]. 
We discard the [=] and treat the sample as one of size 14 with ten [+]s and 
four [—]s. So the test statistic is 4. From Table 8 (Subsection 4.1), the 
critical value is 2. Since 4 is greater than 2, we cannot reject the 
hypothesis, and we conclude that it is quite possible that the median 
truancy rate in Essex is 1.0%. 
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In general, the procedure for dealing with ties is as follows. 


Procedure: the sign test with ties 


For a sample of size n containing m ties (that is, m of the sample 
values are equal to the assumed median), discard the m ties and treat 
the sample as one of size (n — m). 





Mark one answer 


You are going shopping and need to buy 
neckties for all your male relatives. 
Which way should you go? 


First left | ] 
Second left | ] 
Straight on |_| 


Do a U-turn O 








Another sign test with ties! 


Activity 35 Small schools with some tied data 


When truancy rates are only recorded to one decimal place, the following 
are the rates for 23 small schools in the East of England. (The data are 
from Activity 31.) 


0.7 0.7 0.2 1.8 1.0 0.8 1.5 1.0 06 2.8 0.5 1.0 
0.4 0.1 1.2 00 0.8 0.6 0.7 0.2 0.7 0.8 0.7 


We want to test the hypothesis that the truancy rate of small schools in 
the East of England is 1.0%. 


(a) How many values are above the hypothesised median, how many are 
equal to it, and how many are below it? 


(b) What is the test statistic, and what is the sample size that should be 
used in the hypothesis test? 


(c) Test the hypothesis and state your conclusion. 


5 Testing hypotheses II 


Activity 36 Change in Sheffield temperature? 


Meteorological records are available that give the average daily maximum 
temperature in Sheffield in August for each year from 1883, apart from 
three years (1918, 1919 and 1923). The median of these temperatures from 
1883 to 2011 is 19.7°C. The following stemplot gives the average daily 
maximum August temperatures in Sheffield in the 30 years from 1982 to 
2011. If the climate has not changed since 1883, then these temperatures 
should be a random sample from all the average daily maximum annual 
temperatures in 1883-2011. That is, the median of these temperatures 
should be 19.7°C. 
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Figure 14 Stemplot of the average daily maximum August temperature 
in Sheffield (1982-2011) 


(Data source: Met Office, historic station data) 


We want to test the hypothesis that the values in the stemplot are a 
random sample from a population whose median is 19.7°C. 


(a) How many values are above the hypothesised median, how many are 
equal to it and how many are below it? 


(b) What is the test statistic, and what is the sample size that should be 
used in the hypothesis test? 


(c) Test the hypothesis and state your conclusion. 
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5.3 Conclusions and reservations 


In Sections 4 and 5, the focus has been on truancy rates in different types 
of (secondary) schools in the East of England, where the rate is 0.98%. 
We found evidence that small schools in the region had a lower median 
truancy rate than 0.98%, while there was little evidence that large schools 
or schools in Essex had a different rate. However, in drawing a conclusion 
from a hypothesis test, or making any other form of statistical inference, 
we should think a bit about whether we have any reservations about the 
analysis. Let us do that now. 


First, consider the data we used. Truancy rates for schools were probably 
obtained from class attendance registers. Presence or absence of each child 
is noted at the beginning of each morning and afternoon session. However, 
it is well known that some children attend for the register and then 
disappear either for the whole session or perhaps just for one particular 
lesson they dislike. 


Also, some schools are probably more disciplined than others in collecting 
and recording the data. Indeed, because the data are published, it is 
conceivable that some schools might feel that a low truancy rate would 
attract new pupils. Other criticisms could also be raised, so we must have 
some reservations about the accuracy of our data. However, they are from 
the best publicly available source of information on truancy rates. 


Second, let us think about the type of conclusion we drew. We decided 
that there was no reason to assume that the median truancy rate for large 
schools in the East of England was different from 0.98%, which was the 
overall rate for all schools in the East of England. Notice that we do not 
say that the rate for large schools is equal to 0.98%. The fact that we do 
not reject the hypothesis certainly does not justify us assuming that the 
median is equal to 0.98%. If we examined all the large schools in the East 
of England, we would almost certainly find that the median truancy rate 
for large schools differs in the East of England and is not exactly 0.98%. 
On the basis of the sample we have, though, it is unclear whether the 
median rate would turn out to be above or below 0.98%. 


Note that a statistical test can tell us nothing about causation. For 
example, we concluded in Activity 31 that the truancy rate for small 
schools in the East of England is probably lower than the overall rate for 
all schools in the East of England. We cannot conclude from this that the 
small size of a school causes the truancy rate to fall. There may well be 
other factors, like the situation of the school, which influence both the size 
and the truancy rate. 


The reservations expressed in this subsection may have left you feeling that 
it was not worth doing the statistical analyses. However, the analyses have 
added to our knowledge about truancy rates. The point is that a statistical 
analysis enables you to quantify the uncertainty that exists in drawing 
conclusions — in most other circumstances, the uncertainty of conclusions is 
either not recognised or is ignored. 








‘Thanks Mary, that does put our data tw perspective.’ 


Exercises on Section 5 


Exercise 8 A second look at mice and mirrors 


In Exercise 7 (Section 4) an experiment was described in which 12 out of 
15 mice preferred a cage without a mirror, while three mice preferred a 
cage with a mirror. The exercise asked for a test of the hypothesis that the 
presence/absence of a mirror does not influence where a mouse spends its 
time. Using the probabilities given in Figure 9 (Subsection 4.1), determine 
the p-value of the test and evaluate the evidence against the hypothesis. 





Exercise 9 Change in depression 


A psychologist rated 16 subjects undergoing withdrawal from narcotics on 
the basis of the extent of their depression before and one hour after 
receiving a dose of methadone. The degree of depression was rated on a 
scale from 1 (= no depression) to 5 (= severe depression). The results are 
given in Table 11. 


Table 11 Depression scores before and after a dose of methadone, and 


their differences 
Subject 1 2 8 9 10 11 12 13 14 #15 


3 

Before 2 1 2 3 2 
After 1 1 3 
1 


eH 
me 
ons le >) 


Change —1 0 





16 
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If methadone does not affect depression, then the difference between a 
subject’s ‘before’ and ‘after’ scores is equally like to be above or below 0. 
(It could also equal 0.) Thus, we want to test the hypothesis that the 
median difference between ‘before’ and ‘after’ scores is 0 — rejecting this 
hypothesis would show evidence that methadone affects depression. 


(a) How many values are above the hypothesised median, how many are 
equal to it, and how many are below it? 


(b) What is the test statistic, and what is the sample size that should be 
used in the hypothesis test? 


(c) Test the hypothesis and state your conclusion. 





6 Computer work: probabilities and 
the sign test 


In this section you will use the statistical package Minitab to calculate 
probabilities of the type that you calculated (for small sample sizes) in 
Subsection 3.2. You will also learn how to use Minitab to perform a sign 
test for a population median. You should now turn to the Computer Book 
and work through Chapter 6. 


Summary 


In this unit, you have been introduced to the basic concepts of probability 
and hypothesis testing. These are incredibly important in statistics: 
probability underpins most statistical methods, and one of the most 
common tasks in statistical inference is hypothesis testing. A consequence 
is that the ideas introduced in this unit will be reinforced in subsequent 
units and, indeed, in any further statistics modules that you study. 


You have learned the fundamentals of probability — its definition as a 
proportion, the addition rule and the multiplication rule. You have also 
learned how to count the number of ways that combinations can occur, so 
that you can now calculate the probability distribution for the number of 
sample data that will exceed the population median. You used this 
probability distribution to perform a sign test — while simultaneously 
learning the structure of a hypothesis test, the meaning of a p-value, and 
how to report your conclusions and consider reservations you might have 
about the test. The range of tasks you can perform with Minitab has been 
extended. 


Learning outcomes 


Learning outcomes 


After working through this unit, you should be able to: 


appreciate how the modelling diagram from earlier units can be 
modified to take account of the important step of inferring back from 
the sample to the population 


appreciate that a general question needs to be clarified and made more 
precise before it can be answered using statistics 


calculate probabilities based on random selection 

apply the addition rule for mutually exclusive events 

apply the probability rule for complementary events 

apply the multiplication rule for statistically independent events 
count the number of ways that a specified combination can occur 


calculate the probability distribution for the number of values above 
the median of the population 


understand the concepts of a hypothesis test and of rejecting a 
hypothesis at the 5% significance level 


look up the critical value for the sign test at the 5% significance level, 
and use this value to determine the critical region 


apply the sign test, including the case when ties are present 


consider what reservations you have about the conclusion drawn from 
a hypothesis test 


use Minitab to calculate the probability distribution for the number of 
outcomes that exceed the median and perform a sign test. 
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Solutions to activities 


Solution to Activity 1 


There are many possible factors you could have thought of. Some are 
given here, but your list may be quite different and equally valid. 


Work: children may absent themselves from a particular lesson because 
they dislike that subject, have not completed homework, etc. 


Home background: parents who take their children to school and are 
interested in their education will encourage regular attendance. 


Bullying: children may stay away because they are being bullied and are 
frightened to go to school. 


Helping at home: children may be encouraged by their parents to miss 
school to help with housework or take care of young siblings, or they may 
feel they need to look after their parents. 


Age: truancy is much commoner among older children approaching 
school-leaving age than among those in primary schools. 
Solution to Activity 2 


As with the previous activity, there are many possible answers here. Some 
are listed below. 


Level: primary or secondary school 
Location: inner city or rural school, for example 
Size of school 


Type of school: independent, academy, etc. 


Solution to Activity 3 


There are many possible reasons that are definitely truancy: missing 
school to go to the cinema/football match/fishing; missing school to avoid 
boredom /a lesson you hate; etc. 


Reasons that would not normally be classed as truancy include illness, 
medical/dental appointments, attendance at family funerals, being 
kidnapped, religious observance. 


Reasons that might (or might not) be truancy include staying off school 
because you didn’t feel that well, when it would depend on how unwell you 
felt and how often it happened. Another example might be staying off 
school to look after a sick brother/sister/parent or go on holiday with 
parents, when it might again depend on how often this happened, and 
might also depend on whether the school had given permission. 


Solution to Activity 4 


In order to compare truancy in different schools, we must somehow take 
account of the number of pupils in each school. If we simply counted the 
number of cases of truancy, then large schools would almost inevitably 
have higher truancy counts. So we need some measure of the rate of 
truancy in a school. Here are three possible ways of collecting data and 
measuring truancy; you may well have thought of others. 


1. 


Use the school attendance registers which record the children present 
at the beginning of each morning and afternoon. Record which of the 
children who were not present had notes of authorised absence due to 
illness or other reason and discount these cases. Calculate the 
proportion of children absent without authorisation, averaged over a 
year. 


Choose a particular week. Count the number of children present for 
each lesson during that week, and divide this figure by the number of 
lessons and the number of children in the school. Subtract this 
number from 1 to give a truancy rate. 


At some arbitrary time, send someone round the school to count the 
number of children present. Subtract this number from the total 
number on the school register to get the number who are absent. 
Divide by the total number of children to give a truancy rate. 


None of these ways is ideal, but you can see that it is not a simple matter 
to obtain a reliable figure; there are many factors to consider. 


Solution to Activity 5 
(a) It would be possible to find the median of the 12 values by drawing a 


we 


stemplot, but with so few, it is probably easier just to list them in 
order: 


0.31, 0.82, 0.83, 1.06, 1.09, 1.19, 1.44, 1.52, 1.84, 1.88, 2.78, 2.90. 
Since the sample size is even, 


sum of two middle data values 
2 


median = 


_ 1194144 


2 
= 1.315 ~ 1.32. 


So the median of the sample is 1.32%. 


The median value of the truancy rate of the sample of 12 large East of 
England schools is somewhat larger than the median truancy rate for 
all the East of England schools. However, a different sample would 
probably have had a different median. In summary, the median 
truancy rate of all large schools in the East of England looks as if it 
might be larger than 0.98%, but we cannot be sure. 


Solutions to activities 


See Subsection 4.2 of Unit 1. 
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Solution to Activity 6 


This is a suggested ranking of the events, from the most likely to the least 
likely. 


C. The sun will rise tomorrow. 

. The sun will shine tomorrow. 

. You toss a coin and it lands tails up. 

. You throw a die and it shows a six. 

. Two out of a group of ten people have the same birthday. 
. Your new colleague at work has the same birthday as you. 


. You will win the jackpot in the National Lottery next week. 


Ger WUQJ 


. A member of a hockey team is 150 years old. 


Your ranking may be slightly different from this, but it should be fairly 
similar. 


(Note that event C can be regarded as certain to happen, whereas event H 
is impossible.) 

Solution to Activity 7 

(a) The probability of selecting a male student is 


number of male students 3293 
total number of students 6082 
(b) The probability of selecting a law student is 


~ 0.541. 


number of law students 334 
total number of students 6082 
(c) The probability of selecting a female medical student is 


~ 0.055. 


number of female medical students 206 


= — ~ 0.034. 
total number of students 6082 


Solution to Activity 8 
(a) The probability of selecting a woman is 


number of women in population 
total number in population 
If we apply this definition to the case where there are 5 men and 
5 women, then the probability of selecting a woman is 
5 
— = 0.5. 
10 
(b) For a population consisting of 1 man and 9 women, using the definition 
in the solution to (a) gives the probability of selecting a woman as 
9 


— = 0.9. 
10 


(c) For 9 men and 1 woman, the probability of selecting a woman is 


b= 0.1 
100 
(d) For 10 men and 0 women, the probability of selecting a woman is 
0 
—=0. 
10 
(e) For 99 men and 1 woman, the probability of selecting a woman is 
1 
— =0.01. 
100 
(£) For 0 men and 100 women, the probability of selecting a woman is 
1 
Mor 
100 


Solution to Activity 9 
(a) For each of the events, the probabilities are as follows: 


P(child absent through truancy for 5 to 9 days) 
total number absent for 5 to 9 days 
total number of children 


90 
300 0.3. 
P(child absent for 5 to 9 days and attends School B) 
30 
== =0.1. 
300 i 


P(child from School A is absent for 10 to 19 days) 
number absent for 10 to 19 days from School A 
total number at School A 


26 
= — = 0.13. 
200 


P(child absent for > 10 days) 
number absent for 10 to 19 days + number absent for > 20 days 
total number of children 


45+1 
a W 0.2. 
300 300 


(b) Let C be the event that a child is absent through truancy for 5 to 
9 days and let B be the event that a child attends School B. (You 
might well have chosen different letters: that does not matter.) 


Then 
P(child absent through truancy for 5 to 9 days) = P(C) 


and 


P(child absent for 5 to 9 days and attends School B) = P(C and B). 
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Solution to Activity 10 


(a) 


The events are mutually exclusive, as a person can only have one 
blood type. 


The events are not mutually exclusive. Even though it is not very 
common, a person can have black hair and blue eyes. 


These are not mutually exclusive events. Although all three events 
cannot simultaneously happen, two of them could happen at the same 
time. (In fact, any two of them could happen at the same time.) 


These are not mutually exclusive events. While (i) and (ii) cannot 
occur simultaneously, either of them could occur with (iii). 


Solution to Activity 11 


(a) 


Altogether there are 200 children at School A. Using the data in 
Table 3, the probabilities can be calculated as follows. 


There are 108 children absent through truancy for 0 to 4 days. Hence 


1 
P(child from School A absent for 0 to 4 days) = ae = 0.54. 


There are 60 children absent through truancy for 5 to 9 days. Hence 


60 
P(child from School A absent for 5 to 9 days) = can 0.3. 


Altogether 108 + 60 = 168 children are absent through truancy for 
between 0 and 9 days. Hence 


168 
P(child from School A absent for 0 to 9 days) = co 0.84. 


Let E denote the event that a child at School A is absent through 
truancy for 0 to 4 days, and F denote the event that the child is 
absent for 5 to 9 days. The events FE and F are mutually exclusive, as 
both cannot apply to any one child. 


From the solution to part (a), P(E) = 0.54 and P(F) = 0.3. The 
addition rule for mutually exclusive events states that 
P(E or F) = P(E) + P(F). 


Now P(E) + P(F) = 0.54 + 0.3 = 0.84, which is the value we found in 
part (a) for the probability that a child at School A is absent through 
truancy for 0 to 9 days, that is P(E or F), so the addition rule holds 
in this case. 


Solution to Activity 12 


(a) Let M stand for the event that a man is selected and W stand for the 
event that a woman is selected. 


2 
P= 8 en: 
6082 
Hence, P(W) = 1 — P(M) ~ 1 — 0.541 = 0.459. 
575 
(b) P(selecting a medical student) ~ 6082 > 0.095. 


Hence 
P(selected student is not a medical student) ~ 1 — 0.095 = 0.905. 
This is quicker than the alternative of putting: 


P(selected student is not a medical student) 
number of students in science, arts and law 


number of students 


2602 + 2571 4 
= 602 + 2571 + 33 = 5007 ~ 0.905. 
6082 6082 


Solution to Activity 13 


(a) ‘G does not occur’ is the event that a child selected at random from 
the two schools is absent through truancy for 0 to 4 days or is absent 
for 10 or more days. 

(b) | P(G does not occur) = 1 — P(G) 

90 

300 


=1- = 0:7. 
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Solution to Activity 14 
(a) 


beef 

chicken 
soup fish 
pasta 


quiche 


beef 
chicken 
fish 


< 
< 
< 


salad 
quiche 
beef 


chicken 
fish 
pasta 
quiche 


beef 
chicken 
fish 
pasta 


prawns 


quiche 


Tree diagram for a two-course meal 


(b) There are four vegetarian combinations: soup—pasta, soup—quiche, 
salad—pasta and salad—quiche. 


(c) P(vegetarian meal) 


_ number of vegetarian meal combinations 4 _ 0.2 
~ total number of meal combinations = 20 7 


(d) P(vegetarian first course) 


number of vegetarian first courses 2 
Se f 0.5. 
total number of first courses 4 


P(vegetarian second course) 


number of vegetarian second courses 2 0.4 
total number of second courses 5 _ 





The product of these answers is 0.5 x 0.4 = 0.2, which is the answer in 


(c). 
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Solution to Activity 15 


(a) 


In part (a) of Activity 14, you found that there are 20 different 
two-course meal combinations. From each of these two-course 
combinations we can form three different three-course combinations, 
because there are three choices for the third course. Hence in total 
there are 20 x 3 = 60 different three-course combinations. 


P(third course is good with custard) 


number of third courses good with custard 2 ~ 0.667 


w 


total number of third courses 


Two of the first courses are vegetarian, as are two of the second 
courses, and two of the third courses are good with custard. Hence the 
number of three-course meals that have two vegetarian courses 
followed by a course that is good with custard is 2 x 2 x 2 = 8. (They 
are: soup—pasta—pie, soup—quiche—pie, salad—pasta-—pie, 
salad—quiche—pie, soup—pasta—crumble, soup—quiche—crumble, 
salad—pasta-—crumble and salad—quiche-crumble.) Hence 

P(vegetarian first course and vegetarian second course 


and third course is good with custard) 


= LA ~ 0.133. 
60 


P(vegetarian first course) x P(vegetarian second course) 
x P(third course is good with custard) 
~ 0.5 x 0.4 x 0.667 ~ 0.133. 
This is the same result as found in part (b) for 


P(vegetarian first course and vegetarian second course). 


Solution to Activity 16 


(a) 


(b) 


The events are not independent. Taller people tend to be heavier, so if 
a person is taller than average, then they are more likely to be heavier 
than average. 


These events are independent. The height of the first person has no 
influence on the weight of the second person, assuming the people 
were chosen at random. (If you choose two people standing next to 
each other, then the choices are not random, and the characteristics of 
one person could relate to the characteristics of the other.) 


These events are independent, as the day on which you are born has 
no influence on your weight. 


The events are not independent. For example, if the first two events 
both occur then we know that the third event occurs. 


Solutions to activities 


73 


Unit 6 Truancy 


74 


(e) The probability that the card is an ace is 
number of aces in pack 4 1 
number of cards in pack 52 13 
If we know that the card is a diamond, then the probability that it is 
an ace is 
number of cards that are the ace of diamonds 1 





number of diamonds 13 
These probabilities are the same. That is, knowing that one of the 
events occurs does not change the probability that the other event 
occurs. Hence the events are independent. 


Solution to Activity 17 


(a) Using A, B and C to represent Ashia, Brenda and Clare, the different 
possible samples are shown in the following tree diagram. So, the 
possible samples of size 2 are (A, A), (A, B), (A, C), (B, A), (B,B), 
(B,C), (C, A), (C, B) and (C, C). Altogether there are 9 possible 
samples. 


i 


v= ow > ei = 





w 


AA 


(b) Because there are 3 possibilities for the first member of the sample 
and 3 possibilities for the second member, there are 3 x 3 = 9 possible 
samples altogether. 


Solution to Activity 18 


We have that P(M) = 7/10 = 0.7, so 
P(2M) = P(M) x P(M) = 0.7 x 0.7 = 0.49. 


Solution to Activity 19 


At any selection we can write down the probabilities of selecting a man or 
a woman: 
40 60 


P(M) =; 704 and P(W)= p= 0.6. 


For a sample of size 2, 


P(2M) = 0.4 x 0.4 = 0.16, 
P(0M) = P(2W) = 0.6 x 0.6 = 0.36, 
P(1M) = P(man selected first and woman selected second) 
+ P(woman selected first and man selected second) 
= 0.4 x 0.6 + 0.6 x 0.4 
= 0.24 + 0.24 = 0.48. 


As a check, 
P(2M) + P(OM) + P(1M) = 0.16 + 0.36 + 0.48 = 1.00. 


Solution to Activity 20 


We are now considering only samples where the same person cannot be 
considered twice. There are three women, so the first one can be selected 
in 3 ways. Then only two women remain, so the second can be selected in 
2 ways. Hence the total number of samples of size 2 consisting of two 
women is 3 x 2 = 6. 


As a check, the samples of two women are (A,B), (A, C), (B, A), (B,C), 
(C, A), (C, B), where, for example, (A, B) means that Ashia is selected 
first and Brenda is selected second. 


Solution to Activity 21 


If they are picked in the order chairperson, secretary, treasurer, 
vice-chairperson, then there are 10 choices for chairperson, 9 choices for 
secretary, 8 choices for treasurer, and 7 choices for vice-chairperson, giving 
a total of 


10 x 9 x 8 x 7 = 5040 choices. 


Solution to Activity 22 


(a) Here the order of selection matters. The chairman can be chosen from 
12 members, the vice-chairman from the remaining 11, etc. The 
number of ways of choosing people for the 4 roles equals 
12 x 11 x 10 x 9 = 11880. 


(b) The number of ways of allocating 4 people the different roles is 
4x3x2x1=24. 


(c) The number of ways of choosing a committee of 4 people from 
12 members is 
number of choices of 4 people if order of choice matters 
number of ways of allocating 4 people to 4 roles 
= 11880 


——— = 495. 
24 
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Solution to Activity 23 
_8x7x6 336 © 


= —_—__ = 56. 
3x2x1 6 
_Tx6x5x4x3 2520 | 


5S es Se SD 
5x4x3x2x1 120 


4 
40, =-=4. 
1 


803 


7 


Solution to Activity 24 


(a) The four probabilities required are: 


P(O[+]) = P(I-], [-], [-l) = P(-]) x PIEI x AIE 
=Exdxd= (= 

PQ[+]) = P(-], [-] 11) + PU-], G1 (-)) + P+], [-) (-) 
= (4) + (2) + @)=3* G)=8 

P(2[+]) = P(-], H, HD + P, [-] +1) + PCH, (+) [-) 


and 
PBH) = PH, HH = ix ixi =h 
Adding these probabilities together gives 
POH) + PAH) + P2H) + PBH) = 4+ + +41, 


as required. 
































(b) 
Sample of size 3 
2. 
8 
Number of [+]s, x| 0 1 2 3 = 
noe 
4 
PCat) bean] = 
1J 
Dgn 
© l 2 3 
x 


Probability distribution for a random sample of size 3 
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Solution to Activity 25 
(a) Nown=4. 


4 4 
1 il 
4 @ fix? 4 1 
= — X — = — = —, 
1 2 16 4 


4x3 (1\"_ 6 _3 
Bel ay 16 8 


P(1[+]) = 





A 


Y 
— 
wow 
+ 
Saa 
II 


Sie Na Ne ne 
iN 


( 








P(4[+]) = 


Probability distribution for a random sample of size 4 


Number of [+]s,2 0 1 2 3 4 
I 1 a Lf i 
a, 6 78 1 1 


The probabilities are positive and 
1 1, 3, 1 1 1+4+6+4+1_ 
16 16 


6'4 ata 





1. 


_4x3x2 (/1\f 4 1 
~ 3x2xi1 2) 16 4 


1)" _4x3x2x1 1)" Li 
2) Ax3x2x1 2) 16 


From the table, the probability that two of the selected values lie 
above the median (and hence two lie below it) is 2. 
If all selected values lie on the same side of the population median, 
they must be either all above or all below it. Applying the addition 


rule for mutually exclusive events, we obtain 


P(4[+] or 0[+]) = P(4[+]) + P(0[+]) 
= f= (50108), 
16 16 8 


Solution to Activity 26 


P(number of [+]s equals 9) = °C x (3) 


_ 12x11x10x9x8x7x6x5x4 
9x8x7x6x5x4x3x2xIl 
= 12x11x10 
= 3x2x1 


x 0.000 244 





x 0.000 244 ~ 0.054. 
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Solution to Activity 27 


There is no definitive answer to this question. Your answer may well differ 
from that of the module team, but it should follow the same pattern. 


In the population of all schools, by definition, half will have a truancy rate 
above the median and half will have one below it. As a random sample is 
representative of the population, we might expect this property to apply 
in the sample to some extent. 


(a) If all 12 values were above the assumed median, we should be very 
surprised. It is much more likely that the hypothesised median is not 
the real median. The module team would conclude that the 
hypothesis is almost certainly wrong. 


(b) Again, this outcome seems very extreme, as only 1 value is above the 
hypothesised median. Again, the module team would conclude that 
the hypothesis is almost certainly wrong. 


(c) This result is just what we would expect if the hypothesised median is 
equal to the true median. Thus the module team would conclude that 
the hypothesis is quite possibly true. 


(d) With 7 values above and 5 below, we are only 1 value different from 
half above and half below. If we drew another sample, it could easily 
have 6 above and 6 below, or perhaps 5 above and 7 below. The 
module team would again conclude that the hypothesis is quite 
possibly true. 


(e) Here you should be doubtful. The result is not very extreme, like 
cases (a) and (b), or very close to what we would expect, like cases (c) 
and (d). One could argue for (ii) that the hypothesis is probably 
wrong, though the module team would choose (i) that the hypothesis 
is quite possibly true. Some other phrase, such as ‘the hypothesis is a 
little unlikely’ would capture the view of the module team better. 


Solution to Activity 28 


(a) Since the outcomes are mutually exclusive we can use the addition 
rule to find the required probability. It is 
PO) + PAH) + PLH) + P2) 
œ 0.000 + 0.003 + 0.003 + 0.000 
= 0.006. 





(b) The answer shows that the probability of obtaining one of these 
outcomes, when the population median is 0.98%, is extremely small. 
So if we observed one of these outcomes, it would seem sensible to 
reject the hypothesis and conclude that the population median is not 
equal to 0.98%. 


Solution to Activity 29 


(a) 


(b) 
(c) 


P(0[+]) + P(1[+]) = 0.000 + 0.000 
= 0.000, 


P(0[+]) + P(1[+]) + P(2[+]) = 0.000 + 0.003 
= 0.003, 





P(O0[+]) + P(1[4+]) + P(2[+]) + P(3[4]) = 0.003 + 0.014 
= 0.017, 




















P(O) + POH) + PEH) + P(3[+]) + P(4[+]) ~ 0.017 + 0.042 
= 0.059. 


The probability of 0, 1, 2 or 3 values above the population median is 
the largest probability below 0.025. 


The 5% most extreme outcomes are 0[+], 1[+], 2[+], 3[+] and also 
12[+], 13[+], 14[+] and 15/+]. The last four are the same as 0[-], 
1[-], 2[—] and 3[—]. We should reject the hypothesis if there are three 
or fewer values on one side of the assumed median. The critical value 
at the 5% level for a sample of size 15 is 3. 


This corresponds to 1[+] and 14|—]. The smaller value is 1. As 1 is less 
than 3, we should reject the hypothesis at the 5% significance level. 


The smaller value is 5. As 5 is greater than 3, we should not reject the 
hypothesis at the 5% significance level. 


The smaller value is 3. As this is equal to the critical value, we should 
reject the hypothesis at the 5% significance level. 


Solution to Activity 30 


(a) 


From Table 8, the critical value for a sample of size 21 is 5. So the 
hypothesis would be rejected at the 5% significance level if the 
outcome were any of 0[+], 1[+], 2[+], 3[+], 4[+], 5[+], 0[—], 1[-], 2[-], 
3[—], 4[—] or 5[-]. 

The critical value for a sample of size 32 is 9. So the hypothesis would 
be rejected at the 5% significance level if the sample contained nine or 
fewer values above the assumed median, or nine or fewer values below 
the assumed median. 


The critical value for a sample of size 7 is 0. This means that the 
hypothesis would be rejected only if the sample contained 0[+] or 
O[—]; in other words, it would be rejected only if all sample outcomes 
were on the same side of the assumed median. 


Solution to Activity 31 


(a) 
(b) 


The median truancy rate for small schools in the East of England is 
0.98%. 


Just 4 outcomes lie above the assumed median, and 19 outcomes lie 
below it. The test statistic is 4, the smaller of these numbers. 
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(c) From Table 8 (Subsection 4.1), the critical value for a sample of 
size 23 is 6. 


(d) Since 4 is less than 6, we should reject the hypothesis at the 5% 
significance level and decide that the median truancy rate for small 
schools in the East of England is probably not equal to 0.98%. We can 
take our conclusion a little further. Since there are 19 values below 
0.98% and only 4 values above it, we can conclude that the median 
truancy rate for small schools is probably less than 0.98%. 


Solution to Activity 32 


There are 28 outcomes (differences) and, if the median difference between 
the hybrids were 0 we would, on average, expect half the differences to be 
positive and half to be negative. In fact, only 7 of the differences are 
positive, while 21 of them are negative. 


The smaller of these values is 7, so 7 is the value of the test statistic. 
From Table 8 (Subsection 4.1), the critical value for a sample of size 28 
is 8. 

Hence the hypothesis is rejected at the 5% significance level. Hybrid B 
appears to give a higher yield than Hybrid A. 

Solution to Activity 33 

(a) The median truancy rate for schools in Essex is 0.98%. 


(b) There are ten outcomes above the assumed median, and five outcomes 
below it. The test statistic is 5, the smaller of these numbers. 


(c) From Figure 9, the p-value equals 
2 x (0.000 + 0.000 + 0.003 + 0.014 + 0.042 + 0.092) = 2 x 0.151 = 0.302. 


(d) The p-value is 0.302 (which is quite large), so there is little evidence 
against the hypothesis that the median truancy rate is 0.98% in 
schools in Essex. 





Solution to Activity 34 
(a) The smaller of 11 and 1 is 1, so we want 
P(O[-]) + PAJ- + POf) + PAH) 
= 2x [P(O[+]) + PAI] 
= 2 x (0.000 + 0.003) = 0.006, 
from Figure 6. 


The p-value is 0.006, which is very small, so there is strong evidence 
that the median truancy rate for academies is not 0.98%. It seems 
likely that their truancy rate is higher than that. 


(b) Now the test statistic is 3. 
2 x [P(O[+]) + PAH) + PH) + PBH] 
= 2 x (0.000 + 0.000 + 0.003 + 0.014) = 0.034, 





from Figure 9. 


The p-value is 0.034, so there is moderate evidence that the median 
truancy rate for secondary academies is not 0.98%. The sample data 
suggest that their truancy rate is lower than that. 


Solution to Activity 35 


(a) Four values are above 1.0, three values are equal to 1.0, and 16 values 
are below 1.0. 


(b) The test statistic is 4 (the smaller of 4 and 16). Originally there were 
23 observations. We discard the three observations that tied with 1.0, 
leaving a sample size for the test of 20. 


(c) From Table 8 (Subsection 4.1), the critical value at the 5% significance 
level for a sample size of 20 is 5. As 4 < 5, we reject the hypothesis at 
the 5% significance level. The data provide moderate evidence that 
the median truancy rate of small schools in the East of England is not 
1.0% — the median rate appears to be smaller than that. 


Solution to Activity 36 


(a) Nineteen values are above 19.7, three values are equal to 19.7, and 
eight values are below 19.7. 


(b) The test statistic is 8 (the smaller of 19 and 8). Originally there were 
30 observations. We discard the 3 observations that tied with 19.7, 
leaving a sample size for the test of 27. 


(c) From Table 8 (Subsection 4.1), the critical value at the 5% significance 
level for a sample size of 27 is 7. As 8 > 7, we do not reject the 
hypothesis at the 5% significance level. The data provide no clear 
evidence of climate change. (The p-value is actually 0.052, quite close 
to 5%, so there is some evidence that the stemplot data are not a 
random sample from the full set of data.) 
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Solutions to exercises 


Solution to Exercise 1 


This is a suggested ranking of the events, from the most likely to the least 
likely. 


D. Death and taxes. (This gets top spot on the basis of the quote, ‘the 
only things certain in life are death and taxes’.) 


F. You get exactly three heads when you toss a coin five times. (This has 
a 10-in-32 chance of occurring — as you will be able to calculate using the 
results in Section 3.) 


A. A husband and wife find they were born on the same day of the week. 
(This is a one-in-seven chance.) 


H. It snows in London on Christmas day. (Data from 1950-2006 suggests 
this is about a 6-in-100 chance.) 


E. A mother’s first pregnancy results in twins. (Data for the United 
States suggest this is about a 3-in-100 chance.) 


G. A chicken egg has two yolks. (The British Egg Information Service give 
this as less than 1-in-1000 chance.) 


B. England will win the next football World Cup. (It’s probably not as 
likely as getting an egg with two yolks.) 


I. You will be struck by lightning next year. (If you live in the United 
States, the probability is about 1 in 280000.) 


C. A Brazilian team will win the next football European Cup. (As Brazil 
is not in Europe, a Brazilian team cannot play in the European Cup.) 


As in Activity 6, your order may be slightly different from this, but it 
should be fairly similar. 


Solution to Exercise 2 
(a) The probability that the man has blue eyes is 


number of men with blue eyes 2811 
2 = _~ 0.413. 
total number of men 6800 


(b) The probability that the man has brown hair is 


number of men with brown hair 2632 
ee a 0387 
total number of men 6800 


(c) The probability that the man has blue eyes and brown hair is 


number of men with blue eyes and brown hair _ 807 ~ 0.119 
total number of men ~ 6800 0 


(d) The categories ‘brown hair’ and ‘black hair’ are mutually exclusive, as 
a man cannot have both. Hence the probability that the man has 


Solutions to exercises 


brown hair or black hair is 
number of men with brown hair + number of men with black hair 





total number of men 


2632 + 122 
= eee ~ 0.567. 


6800 
(e) The probability that the man does not have black hair is 


1 — probability that he does have black hair 


= number of men with black hair 
E total number of men 


=1 pee 1 — 0.180 = 0.820 
E 6800 — ais 


Solution to Exercise 3 


If A is the event that Sue goes to the hockey match on Saturday and B is 
the event that her team wins, then A and B are statistically independent 

events (assuming that whether or not Sue does go to watch the match has 
no effect on how likely it is that her team wins), so 


P(A and B) = P(A) x P(B) = 0.3 x 0.4 = 0.12. 
That is, the probability that Sue will watch her team play on Saturday 
and they will win is 0.12. 
Solution to Exercise 4 


(a) The number of ways of choosing three flags from seven flags in a 
specified order is 


7x6x5= 210. 
Hence 210 different signals can be made. 


(b) When a signal is flying there are four flags left in the box. The 
number of ways of choosing four flags from seven flags when order 
does not matter is 

7x6x5x4 840 
‘CS. Sa, 
*4x3x2x1 24 


Hence there are 35 different combinations of flag that could be left in 
the box when a signal is flying. 

Solution to Exercise 5 

The probability of obtaining a head is L. Hence 


P(three heads in five tosses of a coin) 


5 

1 5x4x3 1 1 

m |= | So Xk SS = 0 = 8, 
2 3x2x1 32 32 


(This is the same as 
P(three observations out of five are above the median).) 
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Solution to Exercise 6 


The dealer’s claim would be supported if the following were true. 
The median petrol consumption of the car is 37 miles per gallon. 


To determine the test statistic, we count the number of values above and 
below 37.0. There are 8 values above and 26 below. So the test statistic 
is 8. 

From Table 8 (Subsection 4.1), the critical value for a sample of size 34 

is 10. Since 8 is less than 10, we can reject the hypothesis at the 5% 
significance level and conclude that the median petrol consumption of the 
car is unlikely to be 37 miles per gallon. After looking at the sample, we 
can infer that the petrol consumption is probably less than 37 miles per 
gallon, and so the dealer’s claim is not supported. 


Solution to Exercise 7 


There are 15 mice, and if the presence/absence of a mirror did not 
influence cage-preference, we would, on average, expect half the mice to 
spend more time in the cage with a mirror. In fact, only 3 of them did so, 
while 12 spent more time in the cage without a mirror. 


The smaller of these values is 3, so 3 is the value of the test statistic. 
From Table 8 (Subsection 4.1), the critical value for a sample of size 15 
is 3. 

Hence the hypothesis is rejected at the 5% significance level. We can 
conclude that mice appear to prefer a cage without a mirror. 

Solution to Exercise 8 


Three mice spent more time in the cage with the mirror, and 12 spent 
more time in the cage without a mirror. 


The smaller of these values is 3, so 3 is the value of the test statistic. 
From Figure 9, the p-value equals 
2 x (0.000 + 0.000 + 0.003 + 0.014) = 0.034. 


This is moderately small — the p-value is between 0.01 and 0.05. Thus 
there is moderate evidence against the hypothesis. That is, there is 
moderate evidence that mice prefer a cage without a mirror. 


Solution to Exercise 9 


(a) Three values are above 0, four values are equal to 0, and nine values 
are below 0. 


(b) The test statistic is 3 (the smaller of 3 and 9). Originally there were 
16 observations. We discard the four observations that tied with 0, 
leaving a sample size for the test of 12. 


(c) From Table 8 (Subsection 4.1), the critical value at the 5% significance 
level for a sample size of 12 is 2. As 3 > 2, we do not reject the 
hypothesis at the 5% significance level. The data provide no clear 
evidence of methadone changing depression status. 
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Factors affecting reading 


Introduction 


Introduction 


In Unit 6, we met some statistical techniques which enabled us to compare 
the truancy rate in large secondary schools in the East of England with the 
general truancy rate experienced by all secondary schools in the East of 
England. The method employed was to analyse sample data and use the 
results of the analysis to make inferences about the population from which 
the sample was drawn. In particular, we saw how the sign test enabled us 
to decide whether or not to reject, at the 5% significance level, the 
hypothesis that the population median takes a particular value. 


Medians are just one measure of location. In this unit we return to 
hypotheses about location, and you will meet another hypothesis test, the 
‘z-test’, which concerns means, rather than medians. For much of the unit 
we will be dealing with situations where we have a sample from a single 
population, as in Unit 6. We will then develop the ideas of hypothesis 
testing so as to compare two populations in terms of their locations. This 
involves setting up a hypothesis about the locations of the two populations 
(means, here, rather than medians) — the most common hypothesis is that 
the locations of the two populations are equal. A random sample of data is 
taken from each population, and these data are analysed to see whether or 
not to reject the hypothesis. Such tests are called ‘two-sample tests’, in 
contrast to ‘one-sample tests’ in the case of one population. 


The emphasis will be on the development of statistical techniques, and, as 
in Unit 6, we shall explore many of the ideas in the context of a question 
taken from the general area of education. This time we shall be looking at 
the achievement of 7- and 8-year-old children in reading: 


What factors affect a child’s reading ability? 


Section 1 starts with a brief discussion of this question. We shall then look 
at an available source of data and identify what aspects of the general 
question we can consider. 


The next step will be to define specific questions of interest and use them 

to set up appropriate hypotheses. We will then begin to develop an 

appropriate sample statistic — a test statistic — with which to perform our 

hypothesis tests, by revisiting the idea of sampling distributions in You were introduced to 

Section 2. This notion will lead us to consider a particular distribution sampling distributions in Unit 4; 
known as the ‘normal distribution’. In Section 3, we look closely at this they were used somewhat 
distribution, which is of great importance in statistics. raphy on Lb 


In Section 4, we go on to consider how the normal distribution helps us to 
define a usable test statistic, along with its sampling distribution. 

Section 5 is concerned with the application of the resulting z-test to the 
analysis of a sample of data from one population. Section 6 extends these 
ideas to investigate the difference between the means of two populations. 
One important aspect of these z-tests is that they are suitable only for 
dealing with (quite) large samples of data. 
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In Section 7, you will use Minitab to perform z-tests and learn to interpret 
the resulting p-values. Section 8 draws some conclusions about the 
educational question raised in the first section, and makes some general 
points about z-tests. 


Section 7 directs you to the Computer Book. You are also guided to the 
Computer Book at the end of Subsections 3.1 and 3.3. 


1 Clarifying the question 


In Unit 6, the modified modelling diagram was introduced. 
















1. 2. 
CLARIFY COLLECT 
question data from 





about population random sample 








4. 
INTERPRET 
results 
for population 


3: 
ANALYSE 
data 
from sample 






The modified modelling diagram (Figure 2 from Unit 6) 


In this section, you are going to consider the first two stages of the 
modified modelling process: clarify question and collect data. 


1.1 The question to be clarified 


The question What factors affect a child’s reading ability? is rather too 
general for us to attempt to answer it straight away. We need to make it 
more explicit. The first step is to understand what is meant by reading 
ability. 
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Activity 1 Measuring reading ability 


How would you measure reading ability? Write down two or three 
measures of reading ability of 7- and 8-year-old children that you might 
use. 


There are various different reading tests available to teachers, and they 
normally combine several of the measures mentioned in the solution to 
Activity 1. We shall be using data that have already been collected for us, 
so the measures used have already been defined. 


The next step is to consider the factors that might affect a child’s reading 
ability. 





A reading class 


Activity 2 Factors affecting a child's reading ability 


Write down some factors that you think might affect a child’s ability to 
read. 


The data that we shall use to explore this area will not allow exploration 
of all these factors. Therefore the data have to be examined before a 
decision can be made as to which factors can be explored and what 
questions can be asked. 


1.2 The data to be used 


We shall be looking at the population of British children aged 7 and 8. 
The sample we shall use consists of 7- and 8-year-old children of a certain 
group of parents defined as follows. At least one of the child’s parents — a 


‘cohort member’ — was born in a particular week in April 1970, resides in 
Great Britain, and has been part of a long-term study known as the 
British Cohort Study (BCS). There were more than 17000 such people. 


The BCS had its origins in what was called the British Births Survey, CHANGING BRITAIN. 
which was originally designed to examine the social and biological VES 
characteristics of the cohort member’s mother. That study looked at 
neonatal morbidity, and its results were compared with those of a similar, 
earlier study, the National Child Development Study, carried out in 1958. 
‘Neonatal morbidity’ refers to disease of the child (the cohort member) in 
its first month of life. 


Since 1970, the aims of the BCS have broadened considerably. There have 
been eight follow-up surveys, or ‘sweeps’, carried out in 1975, 1980, 1986, 
1996, 1999-2000, 2004-2005, 2008-2009 and 2012. The follow-up surveys 
attempted to trace the original sample and, in the case of the first two 
follow-ups, to include immigrants born in the same week as the original 
sample. Each follow-up survey looked at different areas of the original 








group’s development into adulthood. The one which included reading skills i (7 suki 


of the cohort member’s children, and from which the data we shall be using studies 
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BAS II was, in fact, updated to 
BAS 3 in 2011. 
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have been taken, was the one carried out by the Centre for Longitudinal 
Studies at the Institute of Education, University of London, in 2004—2005. 
At that time, the age of the people under study was 34 years. 


We are therefore going to concentrate on data relating to the reading 
ability of children who were 7 and/or 8 years old in 2004-2005 and whose 
parents were part of the BCS. The 2004-2005 sweep of the BCS provided 
data on 745 children aged 7 or 8 in total. Of these, only 679 were tested 
for their reading ability. It is this sample of 679 that we shall concentrate 
on in this unit. As we progress through the analysis, you will find that we 
shall be using sample sizes smaller than this, because not all the additional 
information needed was provided in the answers to the questionnaire. 
However, the sample sizes concerned will remain pretty large. 


Activity 3 Is it a random sample? 


Write down some reasons why this sample of children can or cannot be 
considered a random sample of the population of 7- and 8-year-old children 
in Great Britain in 2004-2005. 


We shall return to the issue of randomness of the sample in Section 8, but 
for most of the unit, despite our doubts, we shall assume that it is 
acceptable to treat the sample as if it were a random sample. 


We next need to consider what data we have available. Table 1 shows data 
on the first few children in the sample. In the first column is the child’s 
reading ability as scored using a standard reading test called the BAS II 
Word Reading Ability Score, where BAS stands for ‘British Ability Scales’. 
This value will be referred to simply as the child’s ‘reading score’ in this 
unit. The second column gives the child’s age in months. 


The remaining columns of Table 1 are in coded form; that is, they use 
simple numerically coded values to represent attributes of the child in 
place of more complicated wordings, ranges of numbers or exact numerical 
values. For example, the third column shows the gender of the child, coded 
as 1 for a boy, 2 for a girl. The fourth column again relates to age; this 
time whether the child is aged 7 or 8 is recorded. The fifth column, headed 
‘Parental education’, actually shows whether the cohort member’s 
partner/spouse finished full-time education by the age of 16 or at some age 
over 16; it is used here as a measure of the level of education of the child’s 
parents. The sixth column shows the occupation of the child’s father. The 
codes for the values in columns three to six are given beneath the main 
body of the table. 


Table 1 Part of the dataset on reading from BCS 2004-2005 


Reading Age Gender Coded Parental Father’s 
score (months) age education occupation 

106 91 1 1 1 = 
123 95 2 1 1 1 
123 86 2 1 — 1 
110 92 1 1 1 1 
92 90 2 1 2 = 
129 93 2 1 1 = 
118 97 1 2 — 2 
115 107 2 2 1 2 
117 93 2 1 2 = 
134 89 1 1 1 1 
25 85 1 1 — 2 
110 93 2 1 1 1 
172 94 1 1 1 1 
138 90 2 1 = 2 
56 105 1 2 — 1 
136 100 2 2 1 1 
115 90 2 1 1 = 

1 1 1 1 


160 94 


(This data is copyright and owned by the Economic and Social Data Service.) 


‘Gender’, 1: boy; 2: girl. ‘Coded age’, 1: 7 years old; 2: 8 years old. 
‘Parental education’, 1: finished aged 16 or less; 2: finished aged over 16. 
‘Father’s occupation’, 1: managerial, technical, professional and skilled 
non-manual occupations; 2: skilled manual, partly skilled and unskilled 
occupations. 


You will notice that in some cases information is missing in the sample 
data. This is to be expected, because some people either cannot or do not 
wish to answer specific questions in the questionnaire. The missing data 
will just be ignored for now, but we will return to a brief consideration of 
its possible effects in Section 8. 


A number of factors may have an effect on a child’s reading ability. With 
our choice of data, the factors we can consider are child’s age, child’s 
gender, parental education and father’s occupation. 


Consider ‘child’s gender’ first. What precise question can we ask? It must, 
as usual, be about the appropriate population (that of British children 
aged 7-8 in 2004-2005) and not merely the sample. We might ask, Within 
this population, do boys and girls differ in their reading ability? But we 
should be more precise. As in Unit 6, we shall be looking for a difference in 
location, in this case between reading scores of boys and girls. 


1 Clarifying the question 
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In the next subsection we shall be more precise about the particular 
measure of location to use, but for now a reasonably precise question is: 


For British children aged 7-8 in 2004-2005, did boys’ and girls’ ability 
scores differ in location? 


Similar questions can be asked about the other factors. 


For British children aged 7-8 in 2004-2005, did reading scores differ 
in location according to the level of parental education? 


For British children aged 7-8 in 2004-2005, did reading scores differ 
in location according to their father’s occupation? 


The questions can also be made more focused. For instance, consider the 
first question again. Perhaps there is a difference between boys and girls 
aged 7, but no such difference for 8-year-olds. Because of possibilities like 
this, it may be more appropriate to consider the two age groups separately, 
asking 


For British children aged 7 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


as well as 


For British children aged 8 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


The questions on education and occupation could also be split up 
according to age, in a similar way. 


1.3 Setting up the hypotheses 


We shall try to answer most of these very specific questions by means of 
hypothesis tests. Let us then remind ourselves what is involved by 
referring back to the procedure for the sign test discussed in Unit 6, but 
setting it up in a more formal way. 


We began by making a statement about the population of interest that we 
wished to test. In particular, this was the hypothesis that the population 
median was equal to a specified value, M. The hypothesis that the 
population median is equal to M is known as the null hypothesis. This 
hypothesis is usually denoted by the symbol Hg. Thus the null hypothesis 
in the case of the sign test can be stated precisely in the form 


Ao: Population median = M. 


We then looked at the data to see if there was any evidence that the 
population median did not, in fact, equal M. If there is evidence against 
the truth of the null hypothesis, Ho, then we reject this hypothesis and we 
conclude that there is evidence that the population median is not equal to 
M. That the population median is not equal to M is called the 
alternative hypothesis. An alternative hypothesis is usually denoted by 
the symbol Hı. Thus, if we reject the null hypothesis 


Ao: Population median = M, 
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then we are left with the alternative hypothesis ‘Æ’ is the symbol for ‘is not 
l to’. 
Hı: Population median 4 M, veer ie 


and we say we are rejecting the null hypothesis in favour of the alternative 
hypothesis. 


In Unit 6, a trial in a law court was used as an analogy to hypothesis 
testing. In that context, the null hypothesis is that ‘the defendant is not 
guilty’, while the alternative hypothesis is that ‘the defendant is guilty’. If 
the evidence against the null hypothesis is sufficiently great, then the jury 
should reject that hypothesis in favour of the alternative hypothesis, and 
conclude that the defendant is guilty. 


Returning to the questions concerning children’s reading ability, the first 
step therefore is to set up the appropriate null and alternative hypotheses. 
As you might expect, these correspond to whether or not there is a 
difference in location in reading ability between two groups of children. We 
shall start with one of the questions on gender. 


For British children aged 7 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


However, before defining the hypotheses, it is worth thinking again about 
the data. We actually have data on reading scores and gender for 

396 children who are aged 7 (of these, 206 are boys and 190 are girls). 
Printing all 396 scores here would clearly be cumbersome and waste space. 
We can summarise the data as shown in Table 2. 


Table 2 Summary statistics for data on reading scores of 7-year-old 
children 


Sample size Sample mean Sample standard deviation 


Boys 206 109.31 27.671 
Girls 190 113.42 25.464 


(This data is copyright and owned by the Economic and Social Data Service.) 


You may well be wondering why the summary measures are the mean, T, 
and standard deviation, s, and not some other measures of location and 
spread, such as the median, M, and interquartile range, IQR. A minor 
reason is that the mean and standard deviation are commonly used in 
practice, so more people are familiar with them than with other measures. 
The main reason, though, is that © and s can be used to construct a 
reasonably simple test, in a way that M and IQR cannot. 
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+m 


The calculation of the mean was 
discussed in Subsection 1.3 of 
Unit 2 and the calculation of the 
standard deviation was discussed 
in Subsection 3.1 of Unit 3. 
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Activity 4 Calculating a mean and standard deviation 


Because it is some time now since you worked with the sample mean (T) 
and the sample standard deviation (s), here is a reminder of how to 
calculate these summary measures: 


LE 


n 


= 


where ` x is the sum of all the sample values and n is the sample size; and 


s = V variance, 
L-T)? 


n—-1 
(a) Data on the first eighteen 7- and 8-year-olds taken from the BCS 
2004-2005 results were given in Table 1 (in Subsection 1.2). Extract 
from that table the values of the reading scores for all the 7-year-old 
boys. What is the value of n for this small sample? 


where the variance is 


(b) Calculate 7 and s for the reading scores for 7-year-old boys that you 
extracted in part (a). 


Having paused briefly to examine the sample data, we now move on. We 
still need to state the null and alternative hypotheses associated with the 
question 


For British children aged 7 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


in their precise forms. The null hypothesis will be 


A: For British children aged 7 in 2004-2005, the mean reading score 
for girls was equal to the mean reading score for boys. 


As you will have noticed, Hp is phrased in terms of the population means 
and not, for example, the population medians. The alternative hypothesis 
is naturally taken to be 


Hı: For British children aged 7 in 2004—2005, the mean reading score 
for girls was not equal to the mean reading score for boys. 


The null and alternative hypotheses for other questions listed at the end of 
the previous subsection are similar. 


The British Ability Scales reading score system gives overall mean test 
scores for different age groups in Great Britain. These overall mean test 
scores are given for quite finely defined age groups, from which the authors 
of this unit have come up with the following means for 7- and 8-year-olds: 
the population mean for 7-year-old children is 96, and for 8-year-olds it is 
116. (Actually, these means come from very large samples of children and 
not the whole population, but in practice we can treat them as population 
means.) So a further appropriate question to ask about the data on 
reading scores for 7-year-old children, for example, is whether they are 
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consistent with a population mean of 96. In other words, we could test the 
following hypotheses: 


Ho: For British children aged 7 in 2004-2005, the mean reading score 
was equal to 96 


A: For British children aged 7 in 2004-2005, the mean reading score 
was not equal to 96. 


In testing hypotheses about population medians in Unit 6, the next step 
was to define a quantity that we could calculate from the data that would 
help us to evaluate the truth or otherwise of the null hypothesis. In the 
law-trial analogy, this is the evidence. In the sign test, this quantity was 
the smaller of the numbers of [+]s and [—]s that the sample contains. (See 
Section 4 of Unit 6.) In general, in hypothesis testing, this quantity is 
called the test statistic. So now we need to find suitable test statistics to 
assess the hypotheses about children’s reading abilities. Since these 
hypotheses are about population means or differences between population 
means, the obvious test statistics would involve sample means or the 
differences between sample means. But, as with the sign test in Unit 6, the 
awkward part involves finding what is called the sampling distribution of 
the test statistic; so in the next section we look again at sampling 
distributions. 


Exercises on Section 1 





Exercise 1 Mean and standard deviation for 8-year-olds +f" 


(a) Extract from Table 1 the values of the reading scores for all the 
8-year-old children in the table. What is the value of n for this small 
sample? 


(b) Calculate 7 and s for the reading scores for 8-year-old children that 
you obtained in part (a). 





Exercise 2 Parental education and occupation +f" 


(a) Extract the values of the reading scores for all the children in Table 1 
whose parent’s age on finishing full-time education was 16 or less and 
whose father’s occupation is managerial, technical, professional or 
skilled non-manual. What is the value of n for this small sample? 


(b) Calculate % and s for the reading scores for the sample of children that 
you obtained in part (a). 





Exercise 3 Null and alternative hypotheses? 


Suggest null and alternative hypotheses for comparing the reading abilities 
of the 8-year-old children according to their gender. 
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Histograms were introduced in 
Subsection 1.5 of the Computer 
Book. 
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2 Sampling distributions revisited 


In Subsection 3.3 of Unit 4, you saw what is meant by the sampling 
distribution of the median of a sample, and what happened to such 
sampling distributions as the sample size increased. We now review these 
ideas, but rather than just repeating exactly what was done before, we 
look at the sampling distribution of the mean as opposed to that of 
the median. 


As in Unit 4, in order to look at these sampling distributions precisely, we 
really need to know all the relevant information about the whole 
population. Nobody has information about the reading ability of all 7- and 
8-year-old children in Great Britain, so we cannot work with data exactly 
like those from the BCS. Instead let us look at a population where we 

do have data on everyone, and investigate sampling distributions using 
that. The population is that of all students taking the examination for the 
Open University module Exploring mathematics (MS221) in a particular 
presentation. There were 1234 students in the presentation chosen, and 
their marks in the examination are displayed in Figure 1. This plot is very 
like a histogram with lines instead of bars. The numbers of students 
achieving each mark from 0 to 100 are given by the heights of the lines 
drawn at each mark. These heights are the same as the areas of the bars 
that would have been used on the histogram. But, in addition, the top 
ends of the lines have been joined together. 


This representation gives a good picture of the shape of the population 
distribution of examination marks of students on MS221 in one 
presentation. 
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Figure 1 Numbers of students obtaining each examination mark in 
MS221 


2 Sampling distributions revisited 


Now, there is a modification that we need to make. What will be 
important later are the proportions of students in the population gaining 
each mark. Thus instead of using the vertical axis to measure the actual 
number of students who obtained each mark in the examination, we want 
the population distribution to be described in terms of the proportion of 
students in the population who obtained each mark. We can do this 
simply by dividing each of the actual numbers represented in Figure 1 by 
the total number of students in the population (1234). Hence, 





1 becomes ~ 0.0008, 


1 
1234 


2 becomes 





2 
~ 0.001 
1234 0.0016, 


3 becomes ~ 0.0024, 





3 
1234 
and so on. 


Activity 5 From number to proportion 


US 


The actual number of students scoring 75 marks in Figure 1 is 21. What == 
proportion of students on MS221 in the presentation in question achieved 
75 marks? 


The result of changing from actual numbers to proportions is shown in 
Figure 2. Notice that Figure 2 looks just the same as Figure 1; only the 
scale on the vertical axis has changed. 
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Figure 2 Proportions of students obtaining each examination mark in 
M§S221 
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100 


However, it is not the characteristics of the population distribution of 
exam marks, above, that we are interested in as such. Our focus is going 
to be on the sampling distributions of means of random samples of exam 
marks taken from this population. This is because we will be interested in 
testing hypotheses about the mean examination mark, such as 


Ho: For students on MS221, the mean examination score 
is equal to 65 


Ay: For students on MS221, the mean examination score 
is not equal to 65 


or (using data from other years) 


Ho: For students on MS221, the mean examination score for the 
current presentation is equal to the mean examination score 
for the previous presentation 

Hı: For students on MS221, the mean examination score for the 
current presentation is not equal to the mean examination score 
score for the previous presentation. 


Now we begin our investigation of the sampling distribution of the mean. 
Consider first all possible random samples of size 2 that we might select 
from the population data of 1234 examination marks. There is a great 
number of possibilities (760 761, to be precise!), and we cannot concisely 
picture all the sample values in every one of these possible samples. 
However, as in Unit 4, we can summarise each sample using a summary 
measure, and then picture these in the form of the sampling distribution of 
that summary measure. This time, as suggested above, we use the sample 
mean as our summary measure. 


Activity © Sample means of samples of size 2 


(a) Find the sample means of each of the following samples of size 2: 
(i) 15,35 (ii) 65,77 (iii) 65,52 (iv) 37, 80. 


(b) The exam marks in the population, and hence in any sample, are all 
integers (whole numbers). Are the sample means of samples of size 2 
necessarily integers? If not, what other kinds of value can these sample 
means take? 


In this way it would be possible to calculate the sample mean for every one 
of the 760 761 possible samples of size 2. Different samples can give the 
same sample mean, as (iii) and (iv) in part (a) of Activity 6 illustrate. The 
sampling distribution records the proportions of all these samples with 
each value of the sample mean. A picture of this is shown in Figure 3. 
This represents the sampling distribution of the mean for samples of size 2 
from the population of exam marks. Here all the possible values of the 
sample mean % are indicated on the horizontal axis, and the vertical lines 
represent the heights of the bars that would be used for a histogram of the 
proportion of samples (out of 760761 possibilities) which have each of 
these values as the sample mean. Notice that there are many more lines in 


2 Sampling distributions revisited 


this diagram than there are in Figure 2. That is because this sample mean 
can take about twice as many values, integers and half-integers, as you saw 
in Activity 6, so the histogram can have twice as many bars. Joining the 
tops of the lines again provides us with a good picture of the shape of the 
distribution. 
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Figure 3 Sampling distribution of the mean for samples of size 2 from 
the population of MS221 exam marks 
Activity 7 Distribution of sample means of size 2 


What are the main features of the distribution of sample means of size 2 
shown in Figure 3? 


Let us now find out, as in Unit 4, what happens to the sampling 
distribution as the sample size increases. Let’s first look at the sampling 
distribution of the mean for samples of size 3. 


Activity 8 Sample means of samples of size 3 


[E 
US 


Find the sample mean of each of the following samples of size 3: 
(a) 10, 20, 45 (b) 82, 24, 33 (c) 52, 61, 73 (d) 78, 64, 46. 


Activity 8 indicates that there are even more possible values of the sample 
mean for samples of size 3 than there are for samples of size 2. This means 
that the vertical lines in the sampling distribution will be even closer 
together. For this reason, we stop plotting the lines and just concentrate 
on the shape of the distribution as indicated by the tops of the lines; we 
obtain the picture of the sampling distribution shown in Figure 4. In fact, 
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the ‘joining’ line shown in Figure 4 is made up of lots of very short lines, 

each one joining two adjacent vertical lines. 
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Figure 4 Sampling distribution of the mean for samples of size 3 from 
the population of MS221 exam marks 


Activity 9 Distributions of sample means of sizes 2 and 3 


How does the distribution of sample means of size 3 shown in Figure 4 
compare with the distribution of sample means of size 2 shown in Figure 3? 


Activity 10 Distributions of sample means of sizes 3 and 5 


Figure 5 shows the distribution of sample means of size 5 from the 
population of MS221 examination marks. How does the distribution shown 
in Figure 5 compare with the distribution of sample means of size 3 shown 
in Figure 4? 
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Figure 5 Sampling distribution of the mean for samples of size 5 
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Activity 11 Distributions of sample means of larger sample sizes 


Figure 6 contains pictures of the sampling distributions of the mean for 
larger sample sizes. Notice that we have not indicated the scale on the 
vertical axes in Figure 6, but it is the same in each case. Describe the 
changes in shape of these distributions, as the sample size n increases. 
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Figure 6 Sampling distributions of the mean for samples of size n 


The common shape of the distributions in Figures 4, 5 and 6 is sometimes 
called a ‘bell shape’. You can use the following picture of Big Ben to 
decide whether or not you agree that these distributions are bell-shaped! 
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Big Ben: bell-shaped? 


Now the interesting thing about the sampling distribution of the mean is 
that it will nearly always be approximately bell-shaped (looking something 
like the above figures), no matter what population distribution is taken as 
the starting point. (The sampling distributions of some other quantities, 
such as the sample median, show similar features.) 





Example 1 Sampling distributions of means based on earnings data 


Figure 7 provides a rough picture of the population distribution of 
earnings of all full-time employees in the UK in 2011. 


Proportion 
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Figure 7 Population distribution of earnings of full-time employees 


This population distribution is very smooth. The smoothness results from 
the fact that the population is extremely large and there are so many 
possible earnings that we can record. This means that the vertical lines 
representing the various adjacent proportions would be so close together 
that we could not distinguish between them and so, effectively, the line 
joining the tops is a smooth curve. The distribution is, however, clearly 
right-skew since it has a long tail to the right. This reflects the fact that 
while most employees earn a moderate to ‘medium’ wage, some employees 
earn considerably more, and a few earn very considerably more again. 


2 Sampling distributions revisited 


Figure 8 contains pictures of the sampling distributions of the mean for 
samples of various sizes from this population distribution. 
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Figure 8 Sampling distributions of means based on earnings data 





Activity 12 Distributions of sample means of earnings data 


Describe the main changes in shape of the sampling distributions in 
Figure 8, as the sample size n increases. 


So, again, we see from Example 1 and Activity 12 that even though the 
population distribution is skew, as the sample size n increases, the 
sampling distribution of the mean becomes more and more symmetric and 


bell-shaped. 


What is surprising, though, is that if the sample size is large enough, the 
sampling distribution of the mean will nearly always be this sort of shape, 
no matter what shape the population distribution is. 


The shape of sampling distributions of the mean 


For most practical purposes, whatever the shape of the original 
population distribution, the sampling distributions of the mean for 
large enough sample sizes are always symmetric and bell-shaped. 


These symmetric bell-shaped distributions that we obtain as sampling 
distributions for large enough values of n are called normal 
distributions. 
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As you will see in Section 3, these distributions have some very interesting 
properties which help us to develop the test statistic that we are working 
towards. 


What is a large enough sample? 


As a rough guide, you can assume that, whatever the population 
distribution, for sample sizes greater than 25, the sampling 
distribution of the mean will always be approximately normal, and in 
practice, we generally assume that it 7s normal. 


In fact, the sampling distribution of the mean will actually be 
approximately normal for sample sizes (much) smaller than n = 25 for 
many population distributions. On the other hand, there are atypical 
population distributions for which the sampling distribution of the mean is 
not (approximately) normal. You will not deal with samples from such 
populations in M140. This allows us to rephrase a previously highlighted 
statement. 


The shape of sampling distributions of the mean, rephrased 


For most practical purposes, whatever the shape of the original 
population distribution, the sampling distributions of the mean for 
large enough sample sizes are always approximately normal. 


Exercises on Section 2 





Exercise 4 Means of samples of size 2 from two small populations 
Consider the following two small populations of values: 


Population A: 10203040 and Population B: 10 38 39 40 


(a) Find the sample mean of each of the six different samples of size 2 that 
you can obtain from Population A. Make a very rough plot of the 
positions of the six sample means along the horizontal axis. 


(b) Repeat what you did in part (a) for Population B. 


(c) Compare the graphs you obtained in parts (a) and (b). Which of the 
two displays a more bell-shaped distribution of sample means? Can 
you think of a reason why this should be so? 





Exercise 5 Change in shape as sample size changes? 


The BCS sample with which we are concerned in this unit comprises a 
total of 679 reading scores (of 7- and 8-year-old children in 2004-2005). We 
will now pretend that this large sample of reading score values is actually 
the entire population of reading score values. Figure 9 contains pictures of 
the sampling distributions of the mean for samples of various sizes from the 
(pseudo-)population distribution of reading scores. Describe the changes in 
shape of these sampling distributions, as the sample size n increases. 


2 Sampling distributions revisited 
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Figure 9 Sampling distributions of the mean as sample size changes 


((a) This data is copyright and owned by the Economic and Social Data Service.) 
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The symbol pu is the lower-case 
Greek letter ‘mu’, pronounced to 
rhyme with ‘new’. 
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3 Normal distributions 


In Section 2 we saw that the sampling distribution of the mean is nearly 
always approximately normal, provided the sample size is sufficiently large. 
In this section we examine some of the properties of normal distributions 
and begin to discover just how important sampling distributions really are. 


But first, we need to introduce some important new terminology. You are 
already familiar with the idea of a sample mean, 7: 
Xox sum of sample values 


r= = - 
n sample size 





In this section we shall also need to refer to the population mean. For a 
population of finite — but very large — size, N, this is calculated in exactly 
the same way, but using all the data values in the population. By 
convention it is labelled u, so that 


Xr x sum of population values 
H => ——=——_—————$ um qx— 
N population size 
Because N is often very large indeed, the population is often actually 
assumed to be of infinite size. For an infinite population, the population 
mean value, u, is the mean of a truly enormous sample — the sample size 
must approach infinity. 


There is a similar distinction between the sample standard deviation, s, 
and the population standard deviation, which is denoted by the 
symbol ø. (This is the lower-case version of the Greek letter ‘sigma’, which 
in upper-case form is `, but there is no connection between the ways that 
these two symbols are used here.) The formulas are 


,__, [Le=D (SESE - EZ Gate Ta)? /n 
pa n—-1 í 


where the summations are over sample values, and 


[EE _ |£- (Sa) /N 
N N : 


where the summations are over population values. 








An important property to note is that o is always a positive number. 





Off duty from thetr work tw statistics 
class u and o take a much needed 
Spring Break tn their native Greece 


3.1 Normal distributions: location and spread 


Normal distributions are important in statistics for two different reasons. 
You met the first of these in Section 2: many sampling distributions of 
summary statistics are approximately normal for large enough samples. 
The other reason is that the distributions of many populations are 
approximately normal. One example is the population of men’s heights 
that you will look at below. In Unit 10 you will see further examples of 
population distributions that are approximately normal. 


Importance of the normal distribution 


Normal distributions are important both as (approximate) population 
distributions, in some cases, and as (approximate) sampling 
distributions, in many more cases. 


The distribution is called the normal distribution because it arises so 
commonly. Normal distributions are also called Gaussian distributions 
after the great German mathematician and scientist C.F. Gauss, who was 
instrumental in their development. They also appear in popular literature 
as the ‘bell curve’. 


3 Normal distributions 
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Carl Friedrich Gauss 


Gauss (1777-1855) was a phenomenal mathematician — one of the 
most productive mathematicians ever. He made exceptional 
contributions in many fields, perhaps number theory most notably, 
but also astronomy, geometry, algebra, geophysics and, amongst 
others, statistics. During the early part of his career he took up the 
challenge of predicting where Ceres would be found. Ceres was a 
dwarf planet that had been observed in 1801 but which then 
disappeared behind the Sun and could not be found when it first 
reappeared. Gauss developed new methods of estimation and 
approximation to locate its position. He later published a monograph 
on the theory of the motion of small planets disturbed by large 
planets, and in this he introduced several important statistical 
concepts, including the normal distribution. It is for this reason that 
the normal distribution is also called the Gaussian distribution, 
though Gauss did not contribute most to the development of its 
properties. (The contribution of Laplace (1749-1827), for example, is 
greater.) 


In this unit, we want to explore certain characteristics of normal 
distributions in order to apply them to sampling distributions. It would be 
possible to do this exploration using the sort of sampling distributions we 
met in the last section. However, the descriptions of what is going on tend 
to look rather complicated, because they involve means of sampling 
distributions of means. To make things clearer, the exploration is therefore 
done in the context of a normally distributed population. 


Each normal distribution is a precise distribution defined by a 
mathematical formula involving the mean and standard deviation. We 
shall not need to use this formula in this module. But despite this 
mathematical precision, in practice the word ‘approximate’ is very 
important above. Real-world populations never have exact normal 
distributions in terms of the mathematical formula; but many are close 
enough to a normal distribution so that it makes sense to treat them as 
having normal distributions, in which case we say they are approximately 
normally distributed. 


Figure 10 provides a picture of the population distribution of the heights of 
all men in Scotland in 2008, based on information given by the Scottish 
Health Survey, 2008. 


3 Normal distributions 
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Figure 10 Population distribution of Scottish men’s heights (in metres) 


This population distribution is very smooth, symmetric and bell-shaped. 
For the rest of this section, we shall assume that the distribution is indeed 
normal. 


The symmetry of the distribution means that the population mean height 
is the value corresponding to the mode (peak) of the distribution: about 
1.75 metres. (In fact, as well as being the mode and the mean, this value is 
also the population median!) This characteristic applies more generally 

so that any normal distribution is symmetric about its mean ju. 


Figure 11 shows normal distributions for different values of the mean u 
and Figure 12 shows normal distributions for different values of the 
standard deviation o. 


111 


Unit 7 Factors affecting reading 























0 17 18 19 20 21 22 23 = eee a a ee 
u = 20 he =a 
(d) (e) 


Figure 11 Normal distributions with different locations 
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112 Figure 12 Normal distributions with different spreads 


The location of a normal distribution on the horizontal axis depends on 
the value of its mean js, as demonstrated by Figure 11. 


As with any distribution, the spread of a normal distribution can be 
measured by the standard deviation of the population, ø. Thus a small 
value of o means that the distribution is tightly clustered about the mean; 
the larger the value of ø, the more spread out the distribution will be — as 
demonstrated by Figure 12. 


Activity 13 What are u and o for this normal distribution? 


Figure 13 shows another normal distribution. By comparing it with 
Figures 11 and 12, can you identify the values of u and ø for this normal 
distribution? 








Figure 13 A normal distribution related to those in Figures 11 and 12 


Location and spread of the normal distribution: 1 


The normal distribution has location specified by the population 
mean, u, and spread specified by the population standard 
deviation, o. 


You have now covered the material needed for Subsection 7.1 of 
the Computer Book. 


You have also now covered the material related to Screencast 1 
for Unit 7 (see the M140 website). 


3 Normal distributions 


@ 
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3.2 Normal distributions: relating means, standard 
deviations and plots 


For a normal distribution, almost the whole of the distribution 

(about 99.7%) is contained within plus or minus three standard deviations 
of the mean. For example, the population distribution of Scottish men’s 
heights (in metres) is normal with mean u œ 1.75 and standard deviation 
c ~ 0.07. Thus 30 ~ 0.21, and so almost the whole of the distribution is 
contained within plus or minus 0.21 metres of the mean 1.75 metres 

(i.e. between 1.75 — 0.21 = 1.54 metres and 1.75 + 0.21 = 1.96 metres). 
You can check for yourself in Figure 14 — which is an annotated copy of 
Figure 10 — that this is indeed the case. 
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Figure 14 Annotated population distribution of Scottish men’s heights 


(Similar percentages are known for all other numbers of standard 
deviations; for instance, 95.4% of the distribution is contained within plus 
or minus two standard deviations, and 68.3% within plus or minus one 
standard deviation.) 


Location and spread of the normal distribution: 2 


The normal distribution has its mode at u, and almost the whole of 
the normal distribution is contained between u — 30 and u + 30. 


The links between the graph of a normal distribution and its mean and 
standard deviation suggest that a picture of the distribution can be used 
to obtain approximate values for its mean and standard deviation. 
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Example 2 Approximate values for the mean and standard deviation 


The population distribution of a certain variable x is known to be normal. 
This distribution is pictured in Figure 15. 





Figure 15 A normal distribution 


The mode of this normal distribution occurs at about « = 5. This means 
that the population mean must be approximately equal to 5. So, u ~ 5. 
We say approximately equal because u may not be exactly equal to 5. It 
could be 5.1 or 4.9; it is impossible to give an exact value here. 








Figure 16 Investigating the spread of a normal distribution 


The dashed lines in Figure 16 indicate that almost all of the distribution is 
contained between x = 3.5 and x = 6.5 (i.e. within 5+ 1.5). This means 
that 30 ~ 1.5, so o œ 0.5. 
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In summary, the normal distribution plotted in Figure 15 is approximately 
the normal distribution with mean u = 5 and standard deviation o = 0.5. 





Activity 14 Approximate values for the mean and standard deviation 


For each of the normal distributions shown in the parts of this activity, 
find approximate values for the mean and standard deviation, using the 
method described above. 


(a) 








Figure 17 Another normal distribution 


(b) 
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Figure 18 Yet another normal distribution 
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Figure 19 And yet one more normal distribution 


Conversely, knowing the mean and standard deviation of a normal 
distribution enables us to make a rough sketch of the distribution. Any 
sketch of a normal distribution will show a symmetric and bell-shaped 
curve. More specifically, the distribution must be symmetric about the 
mean. In addition, almost the whole of the distribution must be contained or 
within plus or minus three standard deviations of the mean. 


abnormally normal 





normally abnormal? 


agar] 


Example 3 Sketching a normal distribution 


The normal distribution of a variable x has mean u = 15 and standard 
deviation ø = 3. To sketch this distribution, draw a symmetric, 
bell-shaped curve centred on the value of u, which in this case is 15. The The 1997 music CD 
standard deviation is ø = 3, so that 30 = 9. We therefore know that just ‘abnormally normal or 
about all the distribution is contained within 15+ 9 (i.e. lies between normally abnormal?’ by the 
15 — 9 = 6 and 154 9 = 24). A sketch of the distribution can therefore be band ‘standard deviation’. 
drawn and should resemble Figure 20. 
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Figure 20 The normal distribution with u = 15 and o = 3 





The scale that is used for the horizontal axis certainly affects the shape of 
the normal distribution, as demonstrated by Figure 21. The important 
thing, though, is that the information conveyed by the sketch remains 
exactly the same. 





Figure 21 The same normal distribution plotted on different horizontal 
scales 


Also, for the aspects we are investigating, the height of the distribution 
does not really matter; all the information we require about the 
relationship between the distribution and its mean and standard deviation 
is provided by the scale on the horizontal axis. For this reason there is no 
need to bother with a vertical scale at all. 


3 Normal distributions 


Activity 15 Sketching a normal distribution 


Sketch the following distributions: 


e The normal distribution of a variable x with mean 1000 and standard 
deviation 100. 


e The normal distribution of a variable x with mean 2 and standard 
deviation 0.25. 


Activity 15 demonstrates that it always makes sense to think of the 
horizontal axis of a normal distribution in terms of the number of standard 
deviations of the variable away from the mean. This is illustrated in 
Figure 22, which is an important picture in understanding the normal 
distribution. Notice how the horizontal scale is marked off using u and ø. 








u- 30 m 20 u= M uto +20 +30 
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Figure 22 The normal distribution with its scale marked in terms of u 
and o 


You have now covered the material related to Screencast 2 for D 
Unit 7 (see the M140 website). — 
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3.3 The standard normal distribution 


We can go one step further than that represented by Figure 22 
(Subsection 3.2) and think of all normal distributions in terms of one 
special normal distribution. This special normal distribution has mean 
zero and standard deviation one, and is called the standard normal 
distribution. It looks like Figure 23. Figure 23, in turn, looks like 
Figure 22 with u and ø in the labels on the horizontal axis replaced by 0 
and 1, respectively: so, u — 30 has become 0 — (3 x 1) = —3, u — 20 has 
become 0 — (2 x 1) = —2, and so on. 








Figure 23 The standard normal distribution 


The standard normal distribution 


The standard normal distribution is the particular normal 
distribution that has mean u = 0 and standard deviation o = 1. 


It turns out that we can transform all normal distributions to the standard 
normal distribution. 
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Example 4 Transforming to the standard normal distribution 


The normal distribution of a variable x with mean u = 10 and standard 
deviation g = 2 is illustrated in Figure 24. 








Figure 24 The normal distribution with u = 10 and o = 2 


First, we can shift the whole of the distribution to the left so that the 
mode occurs at zero just by subtracting 10 from each value of x. This is 
shown in Figure 25. It changes the location of the distribution but leaves 
the spread unchanged. 









2 
Subtract 10 from 
each value of x 











Figure 25 Shifting the distribution of x 


The dashed curve in Figure 25 is now a new normal distribution with 
mean zero and standard deviation 2. This new distribution is the 
distribution of the variable v, say, where v = x — 10. The normal 
distribution of v differs from the standard normal distribution only by 
having standard deviation 2 rather than 1. However, if we now think of the 
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horizontal axis in terms of the number of standard deviations of v away 
from the mean, then we obtain Figure 26. 
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Figure 26 The normal distribution of v = x — 10 with mean 0 and 
standard deviation 2 


Then, dividing every value of v by the standard deviation 2 gives the 
distribution of v/2. This distribution, shown in Figure 27, is the standard 
normal distribution, as required. 
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Figure 27 The normal distribution of v/2 with mean 0 and standard 
deviation 1 


We have shown that if the variable x has a normal distribution with 
mean 10 and standard deviation 2, then the variable v/2 = (a — 10)/2 has 
the standard normal distribution. 





Example 4 is a specific example of the following general result. If we start 
with a normal distribution for x, with mean yz and standard deviation øg, 
then: 


3 Normal distributions 


e By subtracting u from each value of x we obtain the distribution of 
v = x — u. This distribution is normal with mean zero and standard 
deviation o. 


e By then dividing each value of v by o we obtain the variable z = v/o, 
which has the standard normal distribution. 
Combining the formulas for z and v, we find that 
ye 
or 
Transforming a normal distribution to the standard normal 
distribution 


If a variable x has a normal distribution with mean p and standard 
deviation o, then the variable 
w= 
g= nm 
@ 
has the standard normal distribution. 


Activity 16 Transforming some particular normal distributions 


For the normal distributions with the following values of and o, write 
down the appropriate formula to transform the variable x to the variable z 
that follows the standard normal distribution. 


(a) p=10,¢=2 (b) = 100, ¢ = 20 u= l = Ol 


Activity 17 Transforming the distribution of Scottish men's heights 


(a) Assume that the population distribution of Scottish men’s heights h = 
(in metres) is normal with mean u = 1.75 and standard deviation 
o = 0.07. Write down the formula for z which transforms each value of 
the variable h to the number of standard deviations from its mean. 


(b) Calculate the value of z corresponding to each of the following values 
of h (in metres). In each case, interpret your answer by completing a 
sentence of the form ‘So a height of *** metres is *** standard 
deviations *** the mean height of *** metres’. 


h=1.96; h=1.61; h=1.785. 
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Importance of the standard normal distribution 


The development in this subsection implies that by describing every 
normal distribution in terms of z, the number of standard deviations 
by which the variable differs from its mean, we can think of all 
normal distributions in terms of just one distribution: the standard 
normal distribution. 





Wall space increased at the Statistics Art Gallery 
when it became clear that only one picture was 
required in the ‘normal distribution collection. 


You have now covered the material needed for Subsection 7.2 of 
the Computer Book. 


You have also now covered the material related to Screencast 3 
for Unit 7 (see the M140 website). 
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3 Normal distributions 


Exercises on Section 3 





Exercise 6 Approximating the mean and standard deviation 


Find approximate values for the mean and standard deviation of the = 
normal distribution shown below. 








0 10 20 aE 


Figure 28 Yet again, a normal distribution 





Exercise 7 Approximating another mean and standard deviation +7 


Find approximate values for the mean and standard deviation of the 
normal distribution shown below. 








=o = 


5 


Figure 29 And one more time, another normal distribution 





Exercise 8 Sketching a normal distribution 


The normal distribution of a variable z has mean —1 and standard 
deviation 1. Sketch the distribution. 
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Exercise 9 Sketching another normal distribution 


The normal distribution of a variable x has mean 4 and standard 
deviation 4. Sketch the distribution. 





Exercise 10 Obtaining z for a normal distribution 


Write down the appropriate formula to transform the variable x to the 
variable z that follows the standard normal distribution when 


(a) x has the normal distribution with mean 6 and standard deviation 3.3; 


(b) x has the normal distribution with mean —6 and standard deviation 2. 





Exercise 11 Calculating z from x 


Assume that x follows the normal distribution with mean u = 2 and 
standard deviation ø = 10. Write down the appropriate formula for z 
which transforms the variable x to the number of standard deviations from 
its mean. Calculate the value of z corresponding to x = 3. 





Exercise 12 Calculating z from x for another normal distribution 


Assume that x follows the normal distribution with mean u = —1 and 
standard deviation o = 0.5. Write down the appropriate formula for z 
which transforms the variable x to the number of standard deviations from 
its mean. Calculate the value of z corresponding to x = 0. 





4 Sampling distributions re-revisited 


We now take a closer look at the sampling distributions of the sample 
mean that you met in Section 2. As we said there, provided the sample 
size is sufficiently large (roughly speaking, greater than 25), these sampling 
distributions are approximately normal. Thus the ideas discussed in 
Section 3, which apply to all normal distributions, apply (approximately) 
to these sampling distributions as well. These ideas will enable us to find a 
suitable test statistic to use for testing some of the hypotheses we are 
interested in for the BCS survey. 















WHAT Do You 
MEAN MY 
WEIGHT SNT 






IT IS IF t 
LIE DOWN 




















4 Sampling distributions re-revisited 


We begin by examining the relationship between sampling distributions of 
the mean and the original population distribution in a little more detail. 


Activity 18 Means of distributions of sample means 


(a) Consider again the population distribution of MS221 examination 
marks which you met in Section 2. In fact, this population distribution 
has mean u = 66 and standard deviation ø = 22. Figure 30 shows the 
sampling distributions of the mean for various sample sizes. (Figure 30 
is similar to Figure 6 but for some different values of n.) 


What do you notice about the means of these sampling distributions 
compared with the population mean? 


Proportion 
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m= 25 
= 
.Q 
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m= 50 


Proportion 
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m = N00 


Figure 30 Sampling distributions of the mean for samples of size n from 
the population of exam marks 


(b) Consider again the population distribution of full-time employees’ 
earnings which you met in Example 1, in Section 2. This population 
distribution has mean u = 491 and standard deviation o = 283 (in £). 
Figure 31 shows again the sampling distributions of the mean for 
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various sample sizes. (Figure 31 is similar to Figure 8 but for different 


values of n.) 


What do you notice about the means of these sampling distributions 
compared with the population mean? 


Proportion 


Proportion 


Proportion 











500 1000 1500 mr 
m = 2 

500 1000 1500 mr 
m = 

500 1000 1500 mir 
n = 100 


Figure 31 Sampling distributions of the mean for samples of various sizes 
from the population of employees’ earnings 


The conclusions of Activity 18 hold more generally so that whatever the 
population distribution (no matter what shape) and whatever the sample 
size (no matter how small), the mean of the sampling distribution is 
always equal to the population mean p. 


Now let us take a closer look at the spread of the sampling distributions. 
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Activity 19 Standard deviations of distributions of sample means (1) 


(a) Consider again the sampling distributions of the mean for MS221 
examination marks that are shown in Figure 30. What do you notice 
about the standard deviations of these sampling distributions? 


(b) Consider again the sampling distributions of the mean for full-time 
employees’ earnings that are shown in Figure 31. What do you notice 
about the standard deviations of these sampling distributions? 


In fact it can be shown that for population standard deviation o, the 
standard deviation of the sampling distribution of the mean for samples of 


size n is o/y/n. 


Activity 20 Standard deviations of distributions of sample means (2) 
Ea 


The population distribution of examination marks has standard deviation = 
g = 22. Use the formula to find the standard deviation of the sampling 
distribution of the mean for samples of size 


(a) 25; (b) 50; (c) 100. 


Both the formula o/yn and the calculations in Activity 20 confirm that 
the standard deviation of the sampling distribution of the mean does 
decrease as n increases, as was suggested in Activity 19. What is not so 
clear, and is perhaps unexpected, is the precise way in which the standard 
deviation of the sampling distribution of the mean depends on n — through 
its square root. 


The expression ‘standard deviation of the sampling distribution of the 


mean’ is a bit of a mouthful. It is often referred to as the standard error The terminology ‘standard 


of the mean for samples of size n, or sometimes just the standard error error’ is related to the notion of 


for short, in which case it can be abbreviated to the symbol SE. Using this 
abbreviation, we obtain the formula SE = ø/yn, which is easier to 
remember. 


The above result holds generally for all sampling distributions, no matter 
what the population distribution and no matter what sample size is 
involved. So there is a very precise relationship between sampling 
distributions and the population distribution. It can be summarised as 
follows. 


‘sampling error’, which you met 
in Subsection 4.1 of Unit 4. 
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Mean and standard deviation of the sampling distribution of 
the mean 


e The mean of the sampling distribution is equal to u, the 
population mean. 

e The standard deviation of the sampling distribution is called the 
standard error of the mean. It is given by 


g 
SE = —— 
vn 
where n is the sample size and o is the population standard 
deviation. 


It seems you've calculated 
the standard deviation of the mean 
when the question asked for the population 
standard deviation. We see this a Lot. 


it’s a standard error. 





The relationship between sampling distributions and the population 
distribution is particularly useful when the sample size is large and the 
sampling distribution is approximately normal. In practice, we usually 
have very little information about the population distribution itself. 
Indeed we often have only a sample of data on which to base our analysis; 
there is no other information about the population. Yet many techniques 
of statistical inference require us to make some assumptions about the 
population distribution. 


The advantage of working with large samples is that, no matter what 
shape the population distribution is, the sampling distribution of the mean 
for samples of size n will always be more or less normal. Moreover, we 


4 Sampling distributions re-revisited 


know that the mean of this sampling distribution is equal to the 
population mean, u, and the standard deviation is the standard error, 
given by SE = a/,/n, where ø is the population standard deviation. This 
is summarised below. 


Approximate normality of the sampling distribution of the 
mean 


If n is large, no matter what shape the population distribution is, the 
sampling distribution of the mean for samples of size n will be 
approximately normal with mean equal to the population mean, p, 
and standard deviation equal to the standard error, SE = a/,/n. 


(This important result is often called the central limit theorem.) 


Activity 21 Approximate distribution of ball bearing diameters + 


The population distribution of the diameters of ball bearings produced by 


a particular manufacturer has mean pp = 2mm and standard deviation 

c = 0.01 mm. Find the standard deviation of the sampling distribution of 
the mean for samples of 25 such ball bearings. Hence give the approximate 
distribution of the mean diameter of ball bearings in samples of size 25. 


What this implies is that we can base our analysis on the relationship 
between the sample data and the sampling distribution of the mean. Thus 
we infer back from the evidence provided by the sample data to the 
sampling distribution. Then our knowledge of the links between this 
sampling distribution and the population distribution allows us to draw 
conclusions about the population mean. This is a very important strategy 
in statistics. 


The new hypothesis test, the z-test, is based on just this principle and will 
be fully discussed in Section 5. As you now know, the sampling 
distribution of the mean, %, for large samples of size n is approximately 
normal with mean p and standard deviation SE = a/,/n. As with any 
normal distribution, we can transform this normal sampling distribution 
into the standard normal distribution. This means that the distribution of 
the variable 

T-—u 

“SE 

is the standard normal distribution (with mean zero and standard 
deviation one). There is a strong connection between this result and the 
z-test to follow. 


You have now covered the material related to Screencast 4 for E 
Unit 7 (see the M140 website). _ 
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TS 


Unless we need to distinguish 
the one-sample z-test from the 
two-sample z-test that will be 


developed in Section 6, we often 


omit the phrase ‘one-sample’. 


One-sided alternative hypotheses 


will be discussed in Unit 10. 
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Exercises on Section 4 





Exercise 13 Standard deviations of the mean as sample size changes 


The population distribution of full-time employees’ earnings has standard 
deviation g = 283. Find the standard deviation of the sampling 
distribution of the mean for samples of size 


(a) 9; (b) 25; (c) 100. 





Exercise 14 Standard deviations of another mean 


The population distribution of a certain quantity has standard deviation 
o = 3.6. Find the standard deviation of the sampling distribution of the 
mean for samples of size 


(a) 4; (b) 19; (c) 300. 





Exercise 15 Standard deviation of the average content of water bottles 


The population distribution of the amount of water contained in a 
nominally one-litre bottle from a certain manufacturer has mean 

u = 1.01 litres and standard deviation ø = 0.01 litres. Find the standard 
deviation of the sampling distribution of the mean for samples of 40 such 
bottles. Hence give the approximate distribution of the mean amount of 
water contained in samples of 40 one-litre bottles from this manufacturer. 





5 The one-sample z-test 


In this section we shall develop a new hypothesis test, the one-sample 
z-test. The hypotheses are concerned with the mean, u, of the population 
from which the sample is selected. We shall suppose that a particular 
value, say A, is of special interest as a potential value for u. The null 
hypothesis is 


Ao: w= A, 
and the alternative hypothesis is 
Ay: H + A. 


Alternative hypotheses of this form are often called two-sided 
alternative hypotheses. This is because they include both u < A and 
>A. 


The above is the first of the four stages of hypothesis testing that you were 
introduced to at the start of Section 4 of Unit 6. In abbreviated form, 
these are: 


1. Set up the hypotheses that we wish to test. 


2. Determine the sampling distribution of a test statistic under the 
assumption that the null hypothesis is true. 


3. Ascertain how unlikely the observed value of the test statistic is on the 
basis of the sampling distribution. 


4. If the test statistic turns out to have a very unlikely value, then either: 
e avery unusual event has happened, or 


e the sample has provided evidence against the correctness of the 
null hypothesis. 


To develop ideas in the current context, we first consider the simpler case 
where the population standard deviation is assumed to be known, and in 
Subsection 5.2 we consider the more realistic case where it is unknown. 
The tests that are developed make use of the results presented in Section 4 
about the sampling distribution of the sample mean. 


5.1 The z-test with the standard deviation 
assumed to be known 


To describe the z-test we will use a simple (constructed) example. 





Example 5 Has a new method of teaching made a difference? 


For many years a teacher has been using the same method of teaching 
children to read. The scores the children obtain on a reading test have a 
mean of 54.6 and a standard deviation of 8.3. These values will be taken to 
be the population mean and the population standard deviation under the 
old method of teaching. The teacher tries a new method with her current 
class of 34 children, and their average score on the reading test is 58.1. She 
wants to test whether random variation underlies the difference between 
the average of this class (58.1) and the long-term average of previous 
classes (54.6), or whether there is a genuine difference. 


The null and alternative hypotheses are: 


Ho: The old method and new method of teaching children to read 
are equally effective. 

Hı: The old method and new method of teaching children to read 
differ in their effectiveness. 


If u denotes the mean reading score of children taught by the new method, 
we can recast these hypotheses as 


Ho: u = 54.6 
Ay: u A 54.6. 
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The sample mean, 7, is based on the performances of n = 34 children. 
Hence, its sampling distribution is approximately normal, as a sample size 
of 34 is quite large. Moreover, as shown in Section 4: 


e the mean of the sampling distribution of % is equal to u 


e =the standard deviation of the sampling distribution of Z (i.e. the 
standard error of Z) is equal to a/\/n. 


Now, to perform a hypothesis test based on %, the sampling distribution 
under which we calculate probabilities is the sampling distribution of z 
assuming that the null hypothesis, Ho, is true. 


In the present case, if Ho is true, then u = 54.6 and the distribution of T is 
approximately normal with mean 54.6 and standard deviation a/,/n. We 
know that n equals 34 but need to know the value of ø. For this example, 
we shall assume that the population standard deviation of scores with the 
new method is the same as with the old method, so ø = 8.3. All told, 


A=546, 7=58.1, n=34, o0=8.3. 


Now, from the end of Section 4, if the sampling distribution of Z is 
approximately normal with mean u and standard deviation SE = a/,/n, 
then the distribution of the variable 
Ey 
SE 

is (approximately) the standard normal distribution (with mean zero and 
standard deviation one). Thus, if Ho is true, so that u = A = 54.6, the 
distribution of the variable 
T — 54.6 

SE 
is (approximately) the standard normal distribution. 





The variable z is the test statistic for the z-test. Its numerical value in this 
example is 


Z-A 58.1-54.6 


2S x246. 
SE 8.3//34 


= 





The main result that we have obtained so far is summarised in the 
following box. 


Test statistic and its sampling distribution when Ho is true 
and o is assumed known 
For a one-sample z-test, when Ho: u = A is true, the test statistic, 
T-A 
2 = = 


SE 
follows (approximately) the standard normal distribution, where 


SE = o/ Vm. 
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Activity 22 Value of z 


Ta 


+ 
xE 
Calculate the value of the test statistic z for the test of = 
Ho: u = 120 
Hı: u #120, 


when n = 100, x = 112, and o = 15. 


Critical values and critical regions 


If the null hypothesis, Ho, is true, then z should follow the standard 
normal distribution. This distribution has a mean of 0, so if the value of z 
given by our data was very large in size (positive or negative), it would 
suggest that Ho is false. The idea, then, is to reject Ho if the observed 
value of z is ‘too extreme’ and therefore unlikely. Notice that ‘too extreme’ 
covers both large positive values and large negative values, in line with H4, 
which specifies u # A, ‘in either direction’ away from A. If we cannot 
believe that the observed z is an observation from a standard normal 
distribution, then we cannot believe Ho. 


We have calculated the value of the test statistic z that is given by our 
data. Suppose now that the test is to be performed at the 5% significance 
level. As discussed in Subsection 4.1 of Unit 6, Ho will be rejected at the 
5% significance level if z is in the most extreme 5% of values under the 
sampling distribution that applies if Ho is true. This ‘most extreme’ region 
is the critical region of the test. (In this case it is the critical region at 
the 5% significance level.) Because of the discussion in the previous 
paragraph, the critical region consists of two parts: one part comprises the 
most extremely high 2.5% of values under the standard normal 
distribution, and the other part comprises the most extremely low 2.5% of 
values under the standard normal distribution. 


The values defining the ‘inner ends’ of the critical region are the critical 
values. The critical values for the z-test at the 5% significance level are 
1.96 and —1.96. Figure 32 shows the critical values and critical region 
pictorially. 
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Probability that z 
lies in here is 0.025 






Probability that z 
lies in here is 0.025 








Critical region at the 
5% significance level 






Figure 32 The standard normal distribution with the critical region and 
critical values (1.96 and —1.96) shown for a test at the 5% significance level 


Instead of using the 5% significance level for the hypothesis test, we might 
want to perform the test at the more stringent 1% significance level. To do 
this, all that changes is the values of the critical values and hence the 
critical region. The critical values become 2.58 and —2.58, and the critical 
region is rather smaller: see Figure 33. 











Probability that z 
lies in here is 0.005 






Probability that z 
lies in here is 0.005 








Critical region at the 
1% significance level 





Figure 33 The standard normal distribution with the critical region and 
critical values (2.58 and —2.58) shown for a test at the 1% significance level 


The procedure to be followed to complete the z-test is as follows. 
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Completing the z-test 
If z > 2.58 or z < —2.58, reject Ho at the 1% significance level. 


If 1.96 < z < 2.58 or —2.58 < z < —1.96, reject Ho at the 5% 
significance level but not at the 1% significance level. 


If —1.96 < z < 1.96, do not reject Hp at the 5% significance level. 


Activity 23 Where is z in marginal circumstances? 


On a sketch of the standard normal distribution, show where the value of z 
must lie in the marginal case where Ho is rejected at the 5% significance 
level but not at the 1% significance level. 


As noted in Subsection 5.1 of Unit 6, we conclude that there is strong We will use ‘strong’ whenever we 
evidence against the null hypothesis if we reject Ho at the 1% significance reject Ho at the 1% significance 
level. If we reject it at the 5% significance level but not the 1% level, we level in this unit; the evidence 


i ; : might in fact be ‘very strong’, 
conclude that there is moderate (but not strong) evidence against the null De R Hy at 


hypothesis. If we do not reject Ho at the 5% significance level, we have, in the 0.1% level. 
the words of Subsection 5.1 of Unit 6, either ‘little’ or ‘weak’ evidence 
against Ho. 





Example 6 Completing the z-test started in Example 5 


In Example 5, the test statistic (the data z-value) takes the value 2.46. 
This value exceeds 1.96 but not 2.58. We conclude that there is moderate 
evidence that the old and new methods are not equally effective at 
teaching children to read. 


As the new method gave an average score of 58.1, while the average under 
the old method was 54.6, and a higher test score means better reading 
ability, there is moderate evidence that the new method is better than the 
old method. 





Activity 24 A z-test in manufacturing 


Ss 


A firm is engaged in putting finishes on work surfaces for kitchen = 
manufacturers. Previously, the work was done in very large batches, so the 

time spent setting up the machine did not affect production too much. 

However, with a change in the pattern of demand the batch size has had to 

be considerably reduced, so the time spent setting the machine to different 
specifications is becoming more important. 


Last year the manufacturing manager found that the machine setting had 
been changed very many times and the mean time taken for a change was 
26.1 minutes. The operators suggested a way in which the set-up time 
might be reduced, but the manager was unconvinced and feared that the 
set-up time might actually be increased. Nevertheless, it was agreed to try 
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Compare formula for ESE with 


SE = o/yn. 
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out this new method for two weeks. A z-test would then be performed to 
examine whether or not the mean time for setting up under the new 
method differs from the mean time taken last year. 


In the two-week testing period, the machine was reset on 53 occasions, 
taking a mean time of 20.9 minutes. 


(a) What are the appropriate null and alternative hypotheses? 
(b) Give the values of A, 7 and n. 


(c) Assume that the standard deviation, ø, equals 12.3. Calculate the 
value of the test statistic. 


(d) Is the null hypothesis rejected at the 5% significance level? Is it 
rejected at the 1% significance level? 


(e) What do you conclude from the hypothesis test? 


5.2 The z-test with unknown standard deviation 


In Subsection 5.1, we developed one-sample z-tests under the assumption 
that ø is known. Now a is the standard deviation of the population from 
which the sample data are drawn. Typically its actual value will not be 
known, but if we have a large sample then the sample standard deviation, 
s, provides a good estimate of ø. Moreover, provided the sample size is 
large, the one-sample z-test can be performed with o replaced by s. 
Specifically, we calculate the estimated standard error (ESE) of 7, 


S 
ESE = — 
S F 


and put 


ESE ` 

You might be slightly disquieted by the bald assertion that, for large 
samples, replacing SE by its estimated value ESE makes no difference to 
the (approximate) standard normal distribution of ( — A)/SE. After all, 
ESE is not the correct quantity to divide by; SE is. It is the assumption of 
a large sample that saves the day. In Unit 10 we give tests for small 
samples (t-tests) which take the difference between ESE and SE into 
account. Differences between those tests and z-tests are small when the 
sample size is above about 25. 


At the end of Section 2, it was asserted that the sampling distribution of 
the mean will always be approximately normal for sample sizes greater 
than 25. It was also argued that the sampling distribution of the mean will 
actually be approximately normal for sample sizes (much) smaller than 

n = 25 for many population distributions. In that sense, the notion of 

n = 25 being large enough errs on the ‘careful’ side. When SE is replaced 
by its estimated value (ESE), however, a sample size of 25 is only just 
enough for a z-test to be usable. We will continue to use this ‘rule of 


thumb’, but n = 25 is no longer a ‘generous’ value — many would prefer to 
use z-tests only for samples that are a bit larger than that. 


What is a large enough sample for a z-test? 


As a rough guide you can assume that, whatever the population 
distribution, for sample sizes of at least 25, the z-test is applicable. 





As this jolly logo shows, ESE also stands for ‘Exceptional Student 
Education’ ... an educational program in schools in Citrus County, 
Florida, USA 


The next two boxes lay out the full requirements and procedure for the 
one-sample z-test. They cover both the cases where o is known and where 
it must be estimated. The first box gives the key pieces of information that 
you should pick out for a z-test when you are reading details about a 
survey or experiment. 


Key values for a one-sample z-test 

The information you need to know for a one-sample z-test is: 

e the hypothesised population mean (A) under the null hypothesis 
e the sample mean (Z) 

e the sample size (n) 


e the population standard deviation (c), or a good estimate of ø. 


5 The one-sample z-test 
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Procedure: the one-sample z-test 
1. Set up the null and alternative hypotheses, 
Eo j= Al 
Hı: p#A, 
where p is the population mean. 
2. Calculate the test statistic, z: 


e If the population standard deviation (ø) is known, 





Z—A 
z= , where SE = 
e Ifc is unknown but the sample size (n) is 25 or more, 
T-A s 
z= an Where ve 


Here 7 is the sample mean and s is the standard deviation of the 
sample. SE is the standard error of the mean and ESE is the 
estimated standard error. 


3. Compare z with the appropriate critical values, which are 1.96 
and —1.96 at the 5% significance level and 2.58 and —2.58 at the 
1% significance level. 


e If z> 2.58 or z < —2.58, then Ho is rejected at the 1% 
significance level. 


e If 1.96 <z < 2.58 or —2.58 < z < —1.96, then Ho is rejected 
at the 5% significance level but not at the 1% significance 
level. 


e If —1.96 < z < 1.96, then Hp is not rejected at the 5% 
significance level. 


4. State the conclusions that can be drawn from the test. 


We are now in a position to start answering some of the questions we 
asked about the BCS survey data in Subsection 1.3. The investigation 
illustrates use of the one-sample z-test when o is unknown. 





Example 7 Reading scores of 7-year-old children in BCS survey 


In a question that was posed at the end of Subsection 1.3, we asked 
whether the sample of children from the BCS survey in 2004-2005 could be 
considered to have come from the population of children for whom the 
British Ability Scales reading score was developed. The overall population 
mean reading scores for British children are taken to be 96 for 7-year-old 
children. We wrote down the following null and alternative hypotheses: 


Ho: For British children aged 7 in 2004-2005, the mean reading score 
is equal to 96 


Hı: For British children aged 7 in 2004—2005, the mean reading score 
is not equal to 96. 


We can recast these hypotheses as 
Ho: u = 96 
Hı: u #96, 
where p is the population mean of the reading scores of all British 


7-year-old children in 2004-2005. The data from the BCS concerning 
7-year-old children are summarised in Table 3. 


Table 3 Further summary statistics for data on reading scores of 
7-year-old children 
Sample size Sample mean Sample standard deviation 


396 111.28 26.668 


(This data is copyright and owned by the Economic and Social Data Service.) 


Although o is unknown, the sample size is considerably greater than 25, so 
the ESE may be used in calculating z. Thus the information required for 
the z-test is: 


A=96, %=111.28, n=396, s = 26.668. 

We can now calculate the test statistic: 
_E-A_F-A_ 111.28-96 
ESE s/n 26.668 /V/396 


Assuming that the null hypothesis, Ho, is true, 11.40 is a value from the 
standard normal distribution. However, 11.40 is much bigger than the 1% 
critical value of 2.58. Hence the z-test clearly rejects Ho at the 1% level. 








~ 11.40. 


We conclude that there is strong evidence that the mean reading score for 
7-year-old children in 2004-2005 is not equal to the overall mean reading 
score for 7-year-old children. At face value, this is a little surprising — there 
seems no obvious factor to cause a difference in reading ability between the 
7-year-old children in the 2004—2005 BCS survey and the 7-year-olds in the 
population of British children for whom the reading test was originally 
developed. Given that the mean reading score for the 7-year-old children 
in the BCS survey is larger than the overall mean reading score, for some 
reason the BCS children seem to have performed rather better than 
expected (on average). 


The following two activities provide you with practice in applying the 
z-test. The first one continues our investigation of the BCS data. It 
concerns the reading scores of 8-year-old children. The second concerns 
some data on earnings. 
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Activity 25 Reading scores of 8-year-old children in BCS survey 
In the BCS investigation, the following results were obtained for 8-year-old 


children. 


Table 4 Summary statistics for data on reading scores of 8-year-old 
children 


Sample size Sample mean Sample standard deviation 

283 126.92 27.711 
(This data is copyright and owned by the Economic and Social Data Service.) 
The overall mean reading score for 8-year-old children is 116. 


Carry out a z-test to investigate whether the sample of 8-year-old children 
was selected from a population whose mean reading score is equal to the 
overall mean score for 8-year-old children. Comment on your result. 


Activity 20 Wages of female employees 


A random sample of 810 female local government clerical officers and 
assistants had a mean wage of £373.40 per week in 2011 with a standard 
deviation of £138.20. The overall mean weekly wage for female employees 
in 2011 was £381.50. (Source: Annual Survey of Hours and Earnings, 
2011.) Investigate whether the mean weekly wage of female local 
government clerical officers and assistants differed from the overall mean 
weekly wage for female employees in 2011. Comment on your result. 


You have now covered the material related to Screencast 5 for 
Unit 7 (see the M140 website). 


Exercises on Section 5 





Exercise 16 Reading scores of 7-year-old girls in BCS survey 


In the BCS investigation, the following results were obtained for 7-year-old 
girls. (These results have been extracted from Table 2 in Subsection 1.3.) 


Table 5 Summary statistics for data on reading scores of 7-year-old girls 
Sample size Sample mean Sample standard deviation 

190 113.42 25.464 
(This data is copyright and owned by the Economic and Social Data Service.) 
The overall mean reading score for 7-year-old children is 96. 


Carry out a z-test to investigate whether the sample of 7-year-old girls was 
selected from a population whose mean reading score is equal to the overall 
mean score for 7-year-old children. Comment on your result. 
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Exercise 17 Weight of pigs 


A random sample of 533 pigs of a certain breed that had been fed a special = 
diet were weighed. They had a mean weight of 81.92 kg with a standard 

deviation of 15.65 kg. The mean weight of this breed of pig when fed the 

standard diet is 80kg. Evaluate the evidence that the special diet changes 

the mean weight of this breed of pig. 


Exercise 18 An exciting exercise: paint drying +f 


A consumer magazine, when comparing various brands of paint, stated 
that the drying time of one particular brand was exactly four hours. The 
manufacturers of that paint were not particularly pleased with this as they 
believed the drying time for their paint was shorter. They organised a trial 
in which the paint was tested by a random sample of 40 customers, all of 
whom were decorating their living rooms. For this sample the mean drying 
time was found to be 3.80 hours and the standard deviation was 0.55 hours. 


(a) Analyse the sample data to test whether the drying time given by the 
consumer magazine is correct. 


(b) What reservations might there be about your conclusion? 





In 2011/12, the internet (including one national newspaper) was 
abuzz with news of the forthcoming inaugural World Watching Paint 
Dry Championships to be held in Stoke-on-Trent in July 2012. 
Competitors were to each be given a one-metre square patch of 
freshly emulsioned wall at which to stare as it slowly dried. There 
were said to be 42 entrants, from the UK, USA, India and Hungary. 
Unfortunately, there is no evidence that the event actually took place. 
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6 The two-sample z-test 


In this section we develop the two-sample z-test, which is used to analyse 
the difference in locations between two populations. There were plenty of 
examples of this raised in the context of the BCS and its data on reading 
scores in Subsections 1.2 and 1.3. One such question posed there was: 


For British children aged 7 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


Here, the two populations which we wish to compare in terms of their 
reading abilities are the population of British boys aged 7 in 2004—2005 
and the population of British girls aged 7 in 2004-2005. Another example 
is the question 


For British children aged 7-8 in 2004-2005, did reading scores differ 
in location according to their father’s occupation? 


Here, the two populations which we wish to compare in terms of their 
reading abilities are the population of British children aged 7-8 in 
2004-2005 whose father’s occupation was coded 1 in Table 1 (managerial, 
technical, professional and skilled non-manual occupations) and the 
population of British children aged 7-8 in 2004-2005 whose father’s 
occupation was coded 2 (skilled manual, partly skilled and unskilled 
occupations). 


As in Section 5, comparisons will be made using hypothesis tests 
comparing means, and the two-sample z-test will be appropriate when 
both samples are large. To develop this test we use the reading scores from 
the BCS sample. We examine the first of the above questions: 


For British children aged 7 in 2004-2005, did boys’ and girls’ reading 
scores differ in location? 


The following are appropriate null and alternative hypotheses: 


A: For British children aged 7 in 2004-2005, the mean reading score 
for girls is equal to the mean reading score for boys 


Hı: For British children aged 7 in 2004-2005, the mean reading score 
for girls is not equal to the mean reading score for boys. 


We shall now introduce some symbols that will enable us to express our 
hypotheses more concisely and will also be helpful in explaining a 
theoretical result that we need. We are investigating two populations of 
values: the reading scores of all British 7-year-old girls in 2004—2005 and 
the reading scores of all British 7-year-old boys in 2004-2005. We shall let 
the means of these two populations be jz, and up, and the standard 
deviations be og and op. It is worth noting that the values of these 
quantities cannot be known: not all British 7-year-old girls and boys 
actually took this test in 2004—2005. So there is no way we could actually 
calculate Hg, Hp, Tg and op, but they enable us to make precise statements. 


For a start, we can use jl, and up to write the hypotheses concisely as 


Ho: bg = bp 
Hı: Hg F Hp, 
or, equivalently, as 
Ho: Hg ~m = 0 
Hı: Hg — Hp £ O. 
This last form is the one we shall actually use to derive the test statistic. 


Although we do not know test values for all children, the values for the 
samples of girls and boys in the BCS are known. We shall denote these 
samples’ sizes by ng and np, the sample means by zg and Zp, and the 
sample standard deviations by s, and sp. Their values were set out in 
Table 2 (Subsection 1.3), but we do not need them at the moment. 


As we have expressed our null hypothesis as uw, — Hp = 0, it seems 
intuitively sensible to test the hypothesis by looking at the difference 
between the sample means, Tg — Tp. Before we can develop our hypothesis 
test, we need a theoretical result about the sampling distribution of the 
difference between two sample means. 


You already know, from Section 4, that, because ng and np are large, the 
sampling distribution of Tg is approximately normal with mean pu, and 
standard error og/,/Ng, and similarly that the sampling distribution of zp 
is approximately normal with mean jy, and standard error o}/,/np. We 
may conceive the first of these sampling distributions by thinking of all the 
possible samples of size ng that we could select from the population of 
scores of all 7-year-old girls. We then imagine that we could calculate zg 
for each of these samples and look at their distribution. Similar 
considerations apply to the sampling distribution of Zp. 


Now, think of all the possible means T, of samples of size ng of girls and 
also all the possible means zp of samples of size np of boys. If we select 
just one value of Z, and one value of Tp, we can calculate the difference 
Tg — Zp. Now think of all the possible pairs of values 7, and zp we could 
select, and suppose we calculate Tg — Zp for each of them. Then the 
distribution of all these differences is the sampling distribution of the 
difference between two means. 


We require three results that are known about this sampling distribution. 
First, the mean of the sampling distribution of Tg — Tp is equal to Hg — Hp, 
as you might expect. The second result requires the two samples to be 
independent of each other — here that is clearly the case, as the choice of 
girls was completely separate from the choice of boys. As long as the 
samples are independent, the standard deviation of the sampling 
distribution is given by 

o2 2 


SE=,/-£ +2 
Ng np 


$ 


and this standard deviation is called the standard error of the 
difference between two means. Notice that it is larger than the 
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standard errors of Zg and Tp, which are og/,/Ng and o},/,/Np, respectively. 
This is because we are looking at the difference between two sample 
means; both means can vary, so there is more variation in the difference 
between them. Notice also that the standard error of the difference 
between two means is neither the sum nor the difference of the standard 
errors of the individual means. The next box summarises these results. 


Mean and standard deviation of the sampling distribution of 
the difference between two means 


e The mean of the sampling distribution is equal to Hy — Hp, the 
difference between the population means. 


e The standard deviation of the sampling distribution is called the 
standard error of the difference between two means, and is given 


by 
o2 o2 
SE- N 4 2 
Ng Nb 


where ng and np are the sizes of the samples, and og and op are 
the population standard deviations. 


Furthermore, provided the sample sizes are sufficiently large, the sampling 
distribution of the differences between two sample means is approximately 
normal. This is the third result that we require. 


Approximate normality of the sampling distribution of the 
difference between two means 


If ng and np are large, no matter what shape the population 
distributions, the sampling distribution of the difference between two 
means based on samples of sizes ng and ny will in practice be 
approximately normal. 


From these results, Tg — Tp is approximately normally distributed with 
mean 4 = Hg — Hp and standard deviation ø = SE. Thus, the formula 
given in Subsection 3.3 can be used to transform Tg — Tp to a quantity 
which follows (approximately) the standard normal distribution: 

o xæ—pu _ &g-Zp) — (Ug — Hp) 

ig SE 
Now to obtain our test statistic, we assume that the null hypothesis Ho is 
true, SO Hy — Hp = 0. We still cannot calculate z, as we do not know og 
and op. We deal with this problem exactly as we did in Subsection 5.1, by 
replacing og by sg and op by sp. This leads to the estimated standard 
error of Tg — Zp: 

sz 2 


ESE =u 2 pe, 
Ng Np 


Test statistic and its sampling distribution when H, is true 


For a two-sample z-test, when Ho: Hg — Hp = 0 is true, the test 
statistic, 
2 32 


S 
here ESE = ,/ = + 2 
o Ve Ng g np’ 


Ta = Wp 
z= 





follows (approximately) the standard normal distribution. 


For the one-sample z-test, we used the rule of thumb that the sample size 
had to be at least 25. To justify use of a two-sample z-test, we apply this 
rule of thumb to both samples and require that each sample size should be 
at least 25. 


Since the test statistic above has the standard normal distribution 
(approximately) when the null hypothesis is true, the critical values are 
exactly the same as those in Subsection 5.1 for a one-sample hypothesis 
test. We can reject Ho at the 1% significance level if z > 2.58 or if 

z < —2.58, and we can reject Ho at the 5% significance level if z > 1.96 or 
z < —1.96. Otherwise we cannot reject Ho. 





Example 8 Comparing the mean reading scores of girls and boys 


We are now able to perform the two-sample z-test with which the current 
subsection was introduced. The hypotheses are: 

Ho: Hg = H» 

Ay: Hg £ Hp, 
where jl, is the population mean reading score for 7-year-old girls in 
2004-2005, and up is the population mean reading score for 7-year-old 


boys in 2004-2005. The data on which the test will be based were given as 
Table 2 (Subsection 1.3) and are repeated in Table 6. 


Table 6 Summary statistics for data on reading scores of 7-year-old 
children 


Sample size Sample mean Sample standard deviation 


Boys 206 109.31 27.671 
Girls 190 113.42 25.464 


(This data is copyright and owned by the Economic and Social Data Service.) 
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Key values for a two-sample z-test 


In general, call the two groups A and B. The information you need to 
know for a two-sample z-test is: 

e the sample means (T4 and Fp) 

e the sample sizes (n4 and ng) 


e the population standard deviations (o4 and og), or good 
estimates of them (s4 and sp). 


In this example we are using ‘g’ and ‘b’ to distinguish the two groups, 
rather than A and B. We have: 


Tg = 113.42, Tp = 109.31, n,=190, mp, = 206, 
5g = 25.464, sp = 27.671. 


Both ng = 190 and np, = 206 are greater than 25, so we can assume that 
the z-test is applicable. 


We first calculate the value of ESE, the estimated standard error of 
Tg — Tp: 


2 2 2 2 
n= 4 8b — ,/75-464 | 27.672? aoi. 
ng np 190 206 


Hence the value of the test statistic is 
_ Tg — Tp No 113.42 — 109.31 
= ESE — 2.670 

The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 

—1.96 < 1.54 < 1.96, we cannot reject Ho at the 5% significance level. 

There is little evidence to suggest that the mean reading scores in 

2004-2005 for 7-year-old boys and girls were different. 











~ 1.54. 





The procedure for the two-sample z-test is summarised in the following 
box. 


Procedure: two-sample z-test 


1. Set up the null and alternative hypotheses, 


Ho: ba = Mp 
Hı: p4 F bp, 
where u4 and up are the means of populations A and B, 
respectively. 
2. Calculate the test statistic 
E TEA — FEB 
S 
where the estimated standard error of TA — Tp is 


aa = ,/ 24 + 22. 
na wa 


Here, n4 and ng are the sample sizes of random samples from 
populations A and B respectively, 74 and Zp are the sample 
means, and s4 and spg are the sample standard deviations. 


3. Compare z with the appropriate critical values, which are 1.96 
and —1.96 at the 5% significance level, and 2.58 and —2.58 at the 
1% significance level. 


e Ifz> 2.58 or z < —2.58, then Ho is rejected at the 1% 
significance level. 


e If 1.96 < z < 2.58 or —2.58 < z < —1.96, then Ho is rejected 
at the 5% significance level but not at the 1% significance 
level. 


e If —1.96 < z < 1.96, then Ho is not rejected at the 5% 
significance level. 


4. State the conclusions that can be drawn from the test. 


In the two-sample z-test, it doesn’t actually matter which of the two 
groups of interest you label A and which B. If you swapped the roles of A 
and B over, you would change the sign of z but nothing else. In particular, 
the conclusions of the test would be the same in either case. 
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Activity 27 Mean reading scores of girls and boys at age 8 


— In the BCS investigation, the following results were obtained for 8-year-old 
children. 


Table 7 Summary statistics for data on reading scores of 8-year-old 
children 


Sample size Sample mean Sample standard deviation 


Boys 145 126.38 29.927 
Girls 138 127.49 25.064 


(This data is copyright and owned by the Economic and Social Data Service.) 


Carry out a two-sample z-test to investigate whether the mean reading 
score of 8-year-old girls in 2004—2005 was equal to the mean reading score 
of 8-year-old boys in 2004-2005. Comment on your result. 


You might have noticed something interesting about the results for 
7-year-old and 8-year-old children. For the younger children, the girls’ 
sample mean score was 113.42 — 109.31 = 4.11 more than that for boys, 
whereas for the older children the girls’ sample mean score was 

127.49 — 126.38 = 1.11 higher. One might have thought at first glance that 
there was an interesting effect here: at the younger age, girls are ahead of 
boys in reading ability, but a year later boys seem to be catching up. Not 
so, however: our hypothesis tests showed that in neither case was there 
any evidence of a real difference, or therefore, of any such effect. The 
differences in the samples that we observed can easily have arisen by 
chance. 


Does the level of education of parents have an affect on the reading scores 
of their children? In the next activity you will investigate this in the 
context of the BCS survey. This study classified parental education into 
two categories: those who finished full-time education by age 16 and those 
who continued after 16 (see Table 1, Subsection 1.2). 


+a Activity 28 Mean reading scores according to parental education 


This activity addresses another of the questions raised in Subsection 1.2: 


For British children aged 7-8 in 2004-2005, did reading scores differ 
in location according to the level of their parents’ education? 


Table 8 provides the relevant summary data from the BCS. 


Table 8 Summary statistics for data on reading scores and parental 


education 

Parental education Sample size Sample mean Sample standard deviation 
Ended by age 16 389 116.12 28.775 
Continued after age 16 199 123.15 24.603 


(This data is copyright and owned by the Economic and Social Data Service.) 
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(Note that 679 children were tested for their reading ability, but no 
information was available on when the parents of 91 of the children 
completed their education.) 


Carry out a hypothesis test to investigate whether children whose parents’ 
education continued beyond age 16 scored differently on average on the 
reading test from those children whose parents’ education ended by age 16. 


You might have expected the answer to Activity 28 before analysing the 

data. That is, denoting uç as the mean reading score of children whose 

parental education continued after age 16 and up as the mean reading 

score of children whose parental education ended by age 16, you might 

have thought of doing the following: testing the null hypothesis that 

Lic = Hg With the purpose of seeing whether, as you suspect, uç is actually 

greater than upg, disregarding the possibility that uç could be less than pp. 
Hypothesis tests undertaken when a particular type of inequality between 

the two groups is of interest are the one-sided tests mentioned in a margin 

note at the start of Section 5 and to be looked at briefly in Unit 10. —_—= 


You have now covered the material related to Screencast 6 for 
Unit 7 (see the M140 website). 


Exercises on Section 6 





Exercise 19 Mean reading scores according to fathers’ occupations 


ga 


This exercise concerns another question posed in Subsection 1.2, namely: — 


For British children aged 7-8 in 2004-2005, did reading scores differ 
in location according to their fathers’ occupations? 
Table 9 provides the relevant summary data from the BCS. Note that, as 
in Table 1 (Subsection 1.2), ‘1’ denotes ‘managerial, technical, professional 
and skilled non-manual’ occupations while ‘2’ denotes ‘skilled manual, 
partly skilled and unskilled’ occupations. 


Table 9 Summary statistics for data on reading scores and father’s 
occupation 


Father’s occupation Sample size Sample mean Sample standard deviation 


1 316 120.55 24.221 
2 203 117.17 30.085 


(This data is copyright and owned by the Economic and Social Data Service.) 
(No information was available on father’s occupation for 160 individuals.) 


Carry out a two-sample hypothesis test to investigate whether the mean 
score of children whose father had an occupation coded 1 differs from that 
of children whose father had an occupation coded 2. Comment on your 
result. 
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Exercise 20 Calcium for babies 


This exercise is related to an investigation of the effect of vitamin D 
supplementation for the prevention of low levels of calcium in newborn 
babies. The data given in Table 10 come from a clinical trial in which a 
sample of babies who were breast-fed were compared with a sample of 
babies who were bottle-fed: the measured quantity was the level of calcium 
in the baby’s blood (‘serum calcium’) at 1 week of age. 


Table 10 Summary statistics for data on serum calcium for week-old 


babies 

Sample size Sample mean Sample standard deviation 
Breast-fed 64 2.45 0.292 
Bottle-fed 169 2.30 0.274 


(Source: Cockburn et al. (1980) ‘Maternal vitamin D intake and mineral 
metabolism in mothers and their newborn infants’, British Medical Journal, 
vol. 281, pp. 11-14) 


Carry out a two-sample z-test to investigate whether the mean serum 
calcium level of week-old babies was the same whether they were 
breast-fed or bottle-fed. 


Exercise 21 Peak flow rate of lungs 


The peak flow rate is a measure of how well a person’s lungs are 
functioning. It is the maximum rate in litres per minute at which air can 
be expelled through a peak flow meter. In an investigation of the 
possibility that chronic bronchitis, although a disease of adult life, starts in 
childhood, the peak flow rates of a large number of school children without 
persistent coughs were measured. Amongst other details recorded were 
whether the child lived in an urban or a rural area. Data for urban and 
rural areas are summarised in Table 11. Use a two-sample z-test to 
examine whether the average peak flow rate of children differs in these two 
groups. 


Table 11 Peak flow rates for children without persistent coughs 
Sample size Sample mean Sample standard deviation 


Urban 485 226 52 
Rural 637 231 53 


(Source: unpublished data collected by Professor J.R.T. Colley, University of 
Bristol) 
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7 Computer work: one-sample z-tests 


In this section you will use Minitab to perform one-sample z-tests. These 
are similar to the tests you have performed earlier in this unit, except that 
Minitab gives the results of hypothesis tests in terms of p-values, while in 
earlier sections we have only considered specific significance levels (5% and 
1% significance levels). The use of p-values with sign tests was explained in 
Unit 6. Their use with z-tests is identical, but is described explicitly in the 
Computer Book. 


You should now turn to the Computer Book and work through Chapter 7. 
The chapter starts with the interactive computer resources connected with 
Section 3 of this unit; you should do them now if you have not already 
done so. You should then do the Minitab work that is contained in the rest 
of Chapter 7. 


8 Conclusions and reservations 


We have answered many of the questions raised in Section 1, and we have 
learned a lot about children’s reading ability and factors affecting it, at 
any rate for British children aged 7 and 8 in 2004-2005. We summarise our 
conclusions below. As usual, though, after coming to such conclusions, we 
should stop and look for reservations that might arise. 


e Are there any problems with the data that might throw doubt on 
conclusions drawn from them? 


e Were appropriate statistical methods used in analysing the data? 


We shall look at both these questions. To address the second question we 
shall discuss when z-tests should be used. We then note limitations on the 
way conclusions are stated and interpreted. 


Conclusions 


We began this unit by asking the general question: 
What factors affect a child’s reading ability? 


In Section 1, we refined this question to produce several more specific 
questions that we could attempt to answer using BCS data. In Sections 5 
and 6, we carried out hypothesis tests that related to these questions. All 
these tests involved hypotheses about the population from which the BCS 
sample was drawn, that of British children aged 7 and 8 in 2004—2005. 


In Example 7 (Subsection 5.2), we found that we could reject the null 
hypothesis that 7-year-old British children in 2004-2005 had the overall 
population mean reading score for 7-year-olds. Similarly, in Activity 25 
(Subsection 5.2), we found that we could reject the null hypothesis that 
8-year-old British children in 2004—2005 had the overall population mean 
reading score for 8-year-olds. (A related result for 7-year-old girls was 
obtained in Exercise 16 in Section 5.) 
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In Example 8 (Section 6), we found that, for 7-year-old children, the null 
hypothesis that the population mean for boys was equal to that for girls 
could not be rejected. In Activity 27 (Section 6), we also found that the 
same was true for 8-year-old children. 


In Activity 28 (Section 6), we found strong evidence that the mean reading 
score was higher for children whose parents’ education had lasted longer. 
(Something less expected happened with respect to father’s occupation in 
Exercise 19.) 


Reservations about the data 


Probably the main reservation about the data is whether they can be 
considered a random sample from the relevant population. As was 
discussed in Activity 3 (Subsection 1.2), the data do not come from a 
formal random sample of British children aged 7 and 8 in 2004-2005, of 
the sort one might draw using a sampling frame and random numbers. But 
it might still be the case that the data can be treated as if they had been 
drawn in that way. How would a ‘real’ random sample differ from the BCS 
2004-2005 sample? The main difference was raised in Activity 3; all the 
children in our sample have at least one parent who is in the BCS survey, 
and therefore was born in a particular week in 1970. In a true random 
sample of British 7- and 8-year-old children, not every child would have a 
parent aged 34. 


There are other features of the sampling process that might lead to the 
sample of children being unrepresentative: 


e Children could be included only if the BCS 2004-2005 investigators 
had managed to trace their parents. People in the original BCS 
sample whose lifestyles involve moving around a lot may have been 
harder to trace, and therefore their children would be less likely to be 
in the sample. 


e There are missing data. This data may not be missing completely ‘at 
random’ — which might be OK, provided there is not too much of it — 
but its very missingness might be connected to the things you are 
trying to measure. (This is a common problem in real-world statistics.) 


For example, parents with less education might be more reluctant to 
say so in response to a survey, in which case children with such 
parents might be under-represented; worse, such parents might be 
more likely to not respond to the education question if they know 
their child is not reading especially well, and they don’t want to be 
‘blamed’ for this situation. 


e The data on parental education simply give the age at which one of 
the parents left full-time education, and say nothing about which 
parent it was. Also, nothing is said about any qualifications he or she 
gained, or about any part-time study. 


These reservations about the randomness/representativeness of the sample 
are probably less important than the reservation about the parents’ age, 
but they should not be forgotten. 


8 Conclusions and reservations 





Any other reservations? 


8.1 When to use the z-test 


The z-test can be applied in many situations, though it does have 
limitations. In this subsection the characteristics of the test are described 
so that you can recognise when it is appropriate. 


The sample size must be large 


It is unnecessary to know anything about the distribution of the 
population from which the sample is selected, because the test is based on 
the fact that the sampling distribution of the mean of a sample of size n is 
approximately normal, provided n is sufficiently large. 


As asimple rule of thumb, we assume that in the one-sample case, n should 
be at least 25, and in the two-sample case, n4 and ng should both be at 
least 25. If the sample size is less than 25, you should not apply the z-test. 
(If you believe that the population distribution is extremely skew — which 
has not been the case for any distribution in this unit — then it is safer 
only to use the z-test if the sample size is considerably greater than 25.) 


In Unit 10 you will meet another hypothesis test, the t-test, which you can 
apply under some circumstances when the sample size is less than 25. 


The sample values should consist of numerical measurements 


The z-test should be applied only to data which consist of numerical 
measurements. Length, weight, time, scores in a test and petrol 
consumption are all examples of such data. The z-test cannot be applied, 
for example, to data which might be coded, such as perhaps hair colour or 
disease type — with such data the concept of a population mean or a 
sample mean is not really meaningful. 


The samples should be unrelated 


This restriction applies only to the two-sample z-test. The samples from 
the two populations should be unrelated and so not consist of data 
collected in pairs, each pair coming from the same individual. 


All the hypothesis tests that we have performed in this unit were based on 
data that met the requirements for the one- or two-sample z-tests. Sample 
sizes were above 25 (substantially so for the BCS data), sample values were 
numerical measurements (often scores on a reading test), and the 
two-sample z-test was only ever applied to unrelated samples from two 
separate populations. 
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Unit 8. 
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8.2 Limitations in stating conclusions 


In stating conclusions from any hypothesis test, the following factors must 
be borne in mind. 


e A sampling error may have occurred. 


e The conclusions should match the population from which the sample 
was drawn. 


e The conclusions must not make causal statements which are not 
supported by the way the data arose. 


Let us look at each of these briefly. 
Sampling errors 


You should always bear in mind that a sampling error might have occurred; 
that is, the result of any hypothesis test might be due to sampling 
variation. Hypothesis tests do not provide proofs of the truth of either the 
null or alternative hypotheses. They just attempt to assess the evidence 
for or against the hypotheses. For example, if the null hypothesis is 
rejected, that means that there is evidence against the null hypothesis, but 
not that the null hypothesis is definitely wrong. However, with the BCS 
data, in most of the hypothesis tests where we rejected the null hypothesis, 
the test statistic came out much higher numerically than the critical values 
(it easily gave ‘strong evidence’), so with those tests it is unlikely — but 
still possible — that sampling error has led to erroneous conclusions. 


What can we say about the populations? 


A major difficulty with the BCS data is that it is not clear that these data 
can be treated as a random sample from any population. But they are 
clearly likely to be much more representative of the population of British 
children than of, say, Ugandan children. The stated conclusions were 
explicit in referring to British children and to the year, 2004—2005, in 
which the data were collected, although we should perhaps have referred to 
the population of British children, aged 7 or 8 in 2004-2005, who had at 
least one parent aged 34, as all these characteristics are common to the 
children in our sample. Nevertheless, it seems reasonably plausible that 
the data would still be representative of British children in some other year 
close to 2004-2005, say 2003 or 2007, since reading skills are unlikely to 
change very rapidly. But it would be a mistake to apply the conclusions 
directly to the population of British children in 1970, say, or 2013. 


What can we say about causal statements? 


Can we make any conclusions about what might have caused any 
differences for which we have evidence? For the BCS data, the answer is, 
essentially, ‘no’! Our conclusions are not worded in causal terms; for 
instance, we concluded (in Activity 28) that, for British children aged 7 
and 8 in 2004-2005, those whose parental education was beyond the age of 
16 had a higher mean reading score than did those whose parent left 
education earlier. Worded like that, the conclusion says nothing about how 
this difference arose; but there is a great temptation to suppose that the 
parent’s level of education caused the difference in mean reading score. 


Summary 


This causal conclusion goes beyond what the data tell us. Instead, there 
could well be one or more other factors that underly both a child’s reading 
ability and whether a parent of the child was educated past the age of 16. 
We just cannot tell about such things from these data, since they do not 
give us the appropriate information. 


Summary 


In terms of statistical methodology, you have been introduced to the most 
important distribution in statistics — the normal distribution — and you 
have learned to use the distribution in two hypothesis tests, the one-sample 
and two-sample z-tests. In this unit, the normal distribution arose out of 
consideration of the sampling distributions of the sample mean: regardless 
of the distribution of the original data, such sampling distributions were 
seen to become more and more normal-like as the sample size, n, increased. 
You then learned about the normal distribution itself. You saw the way in 
which it depends on two quantities, the population mean, u — controlling 
its location — and the population standard deviation, ø — controlling its 
spread. You also learned how any normal distribution can be related to a 
special normal distribution: the standard normal distribution with u = 0 
and ø = 1. You then found that the sampling distribution of the sample 
mean can be approximated by a normal distribution with mean u and 
standard deviation a/,/n, which is called the standard error of the mean. 


The z-test was first introduced in its one-sample form to address null and 
alternative hypotheses concerning the value of u. Its test statistic was 
developed in two forms, for g assumed to be known and, more usefully, for 
c unknown. You saw how the sampling distribution of the test statistic, 
and hence the critical values associated with the test, arose from the above 
results for the normal distribution. Having learned how to implement the 
one-sample z-test, you went on to learn how to adapt those ideas to 
produce the two-sample z-test; this is applicable to testing hypotheses 
concerning whether or not the means of two unrelated populations are 
equal. In each case, you applied what you learned about hypothesis testing 
in Unit 6 to interpret results in terms of the amount of evidence the data 
provide against the null hypothesis. 


What you learned about children’s reading abilities from the BCS survey 
has been summarised and discussed in Section 8. 
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Learning outcomes 


After you have worked through this unit, you should be able to: 


e appreciate the steps taken to make the unit’s original question, which 
is rather general, more specific 


e recall that the null and alternative hypotheses required for the z-test 
are expressed in terms of population means 


e recognise a bell-shaped distribution 


e appreciate that population distributions can have different shapes, 
some of which are normal 


e appreciate that, whatever the shape of the population distribution, for 
a large enough sample size the sampling distribution of the mean is 
nearly always approximately normal 


e appreciate the relationship between the location and spread of a 
normal distribution and its mean and standard deviation 


e appreciate that it makes sense to think of normal distributions in 
terms of the number of standard deviations of the variable away from 
its mean, and that we can therefore think of all normal distributions 
in terms of only one distribution: the standard normal distribution 


e apply the formula that transforms any variable x with a given normal 
distribution to the variable z with the standard normal distribution 


e understand what is meant by the standard error (of the mean) and the 
estimated standard error in one- and two-sample situations 


e write down the mean and standard deviation of the sampling 
distribution of the mean for samples of size n, given the population 
mean, u, and standard deviation, o 


e follow the reasoning behind the one-sample z-test and apply the test 
when o is assumed known 


e adapt and apply the one-sample z-test when o is unknown 


e understand and apply the two-sample z-test to analyse the difference 
between means 


e use Minitab to perform the one-sample z-test 


e be aware of questions to ask which might lead to reservations about 
the conclusions of a hypothesis test 


e be aware of some of the characteristics of the z-test, and recognise 
when it is necessary to exercise some caution in its use. 


Solutions to activities 


Solution to Activity 1 


There are many possible answers to this question. You can test a child’s 
reading ability by how well they read a coherent passage, recognise 
separate words, name letters, or pronounce separate words. Perhaps you 
have thought of other measures; or you may have thought in terms of a 
standard reading test of some kind. 


Solution to Activity 2 


Some of the factors you may have thought of are pre-school education, 
parents’ education, precise age of child, whether there are other children in 
the family, mental or physical disability, social deprivation, quality of 
teaching, method of teaching, school class size, and parent’s reading to the 
child at an early age. You may have been able to think of a different set of 
possibilities. 


Solution to Activity 3 


To be a random sample of exactly the sort you met in Unit 4, the sample 
would have had to be chosen by using random numbers to select children 
from a sampling frame of all 7- and 8-year-old children in the country. 
Clearly this was not done, so in this sense the sample is not random. 
However, you have previously met examples where a sample that was not 
chosen in this way was nevertheless considered to be representative in the 
same way that a formally selected random sample would be. In other 
words, the key question is not ‘Was this sample chosen using a sampling 
frame and random numbers?’, but ‘Was this sample chosen in such a way 
that it has the same properties as one chosen using a sampling frame and 
random numbers?’ 


The answer to the second question is not so clear in this case. It might 
seem reasonable to treat the original BCS sample of people born in a 
particular week in 1970 as being representative of the general population 
of people born in Great Britain around that time, in the same way that a 
random sample would be representative. It is perhaps less reasonable to 
treat their 7- and 8-year-old children as if they were a random sample 
from the population of all 7- and 8-year-old children in 2004-2005. This is 
because in a true random sample of children, the ages of the children’s 
parents would vary more — in this sample all the children have at least one 
parent born in a particular week in 1970. This might be quite a problem 
because the age and experience of their parents might well be linked to 
how a child’s reading develops. 
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Solution to Activity 4 


(a) The 7-year-old boys are identified in Table 1 by having a value of 1 in 
the third column (Gender — 1 denotes boy) and 1 in the fourth column 
(Coded age — 1 denotes 7 years old). There are six individuals in 
Table 1 that have 1 in each of the third and fourth columns. They 
have reading scores 


106 110 134 25 172 160. 
The sample size is n = 6. 


(b) To calculate 7, 
S "x = 106 +110 + 134 + 25 + 172 + 160 = 707, 


and so 


ae La 
n 6 


Using Method 2 from Unit 3 (Subsection 3.1) to calculate s, 


Ye - 7)? — yr = 2: 


707)? 


~ 13 792.833. 


This means that the variance is 
X (z— T)? _ 13 792.833 
n— 1 E 5 
~ 2758.5667. 
So 


s = vy variance = v 2758.5667 
~ 52.5. 
Solution to Activity 5 


The proportion of students on MS221 in the presentation in question 
achieving 75 marks is the actual number of students receiving 75 marks 
(21) divided by the total number of students sitting the exam (1234). 
That is, 


co 0.0170 
1234 l 
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Solution to Activity 6 














(a) (i) Sample mean = 2 " Ze 2 = 2h. 
(ii) Sample mean = i $ Com = = 11; 
(iii) Sample mean = 2 > 2m mr = 58.5. 
(iv) Sample mean = a 5 au au = 58.5. 


(b) The sample means of samples of size 2 are either integers (as in (i) 
and (ii) in part (a)) or else ‘half-integers’, that is, values of the form 
‘an integer plus a half’ (as in (iii) and (iv) of part (a)). 


Solution to Activity 7 


The distribution of sample means of size 2 shown in Figure 3 is much 
smoother and less jagged than the distribution of the population data 
shown in Figure 2. The distribution of sample means of size 2 is also fairly 
symmetric, about a maximal value at around 70. However, there are 
slightly more sample means less than 70 than greater than 70, meaning 
that the distribution is slightly left-skew (see Subsection 5.2 of Unit 1). 
You might also note that the distribution fades away to almost nothing — 
corresponding to very rare sample mean values — at about 10 or so. 


Solution to Activity 8 
10+20+45 75 


(a) Sample mean = 3 == 25. 
(b) Sample mean = —— = = ~ 46.3. 
(c) Sample mean = Hitin = = = 62. 
(d) Sample mean = AE e 2 = ~ 62.7. 


Solution to Activity 9 


The distribution of sample means of size 3 shown in Figure 4 is much 
smoother than the distribution of sample means of size 2 shown in 

Figure 3 — it is made up of many more very short lines whose overall effect 
is closer to a smooth curve. The sampling distribution in Figure 4 is a 
little more compressed from side to side than that in Figure 3; that is, it 
has a smaller spread. The sampling distribution in Figure 4 is perhaps 
even closer to symmetric than the one in Figure 3. The maximum value 
about which the sampling distribution is approximately symmetric is, 
however, at approximately the same place as the maximum in Figure 3 — 
that is, at about, or a little under, 70. Finally, corresponding to its smaller 
spread, the distribution in Figure 4 fades away to almost nothing at about 
20 or so (and just below 100). 
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Solution to Activity 10 


The spread of the sampling distribution in Figure 5 is a little smaller again 
than the spread of the sampling distribution in Figure 4. It is also the case 
that any skewness apparent in Figure 4 is no longer apparent in Figure 5: 
this time, the distribution is symmetric, falling away smoothly on either 
side of a maximum value a little way below 70. But aside from the change 
in spread, the sampling distribution in Figure 5 is rather similar to the 
sampling distribution in Figure 4; in particular, the maximum is at 
approximately the same place in the two figures, while in both cases the 
sampling distributions fall away from the maximum, first more rapidly and 
then more slowly as they ‘level out’ a long way from the maximum. 


Solution to Activity 11 


As the sample size n increases, the sampling distributions, which all have 
the same symmetric shape, rise more and more sharply to a mode (at a 
little below 70, it seems). Also, the distributions become more and more 
compressed (i.e. the spread decreases as the sample size increases). 


Solution to Activity 12 


For n = 2, the sampling distribution of the mean is right-skew, but a little 
less so than the population distribution. As the sample size n increases, 
the sampling distributions again become more symmetric and bell-shaped. 
The distributions also become more and more peaked and compressed 
about the mode (at about £500). 


Solution to Activity 13 


The centre of this normal distribution is located at the value 1, so, as in 
Figure 11(b), this means that u = 1. The distribution also appears to have 
the same spread as the normal distribution in Figure 12(c), so ø = 2. To 
confirm these claims, notice that the x-axis labels on Figure 11(b) have 1 
added to them (when u = 1) compared with the corresponding labels on 
Figure 11(a) (when u = 0); similarly, the z-axis labels on Figure 13 have 1 
added to them (when u = 1) compared with the corresponding labels on 
Figure 12(c) (when u = 0). 


Don’t worry if you didn’t get this activity right. There is much more on 
changing both u and ø in the normal distribution in the Computer Book 
and Subsections 3.2 and 3.3 to follow. 


Solution to Activity 14 


(a) The mode of this normal distribution occurs at about x = 10. So 
u ~ 10. Almost all the distribution is contained between x = 4 and 
x = 16 (ie. within 10+ 6). So 30 ~ 6 and o ~ 2. That is, the normal 
distribution plotted in Figure 17 is approximately the normal 
distribution with mean u = 10 and standard deviation o = 2. 





(b) The mode of this normal distribution occurs at about x = 100. So 
u ~ 100. Almost all the distribution is contained between x = 40 and 
x = 160 (i.e. within 100 + 60). So 30 œ 60 and o ~ 20. 





That is, the normal distribution plotted in Figure 18 is approximately 
the normal distribution with mean u = 100 and standard deviation 
o = 20. 


(c) The mode of this normal distribution occurs at about z = 1. So u œ 1. 


Almost all the distribution is contained between x = 0.7 and x = 1.3 
(i.e. within 1+0.3). So 30 ~ 0.3 and ø œ 0.1. That is, the normal 
distribution plotted in Figure 19 is approximately the normal 
distribution with mean u = 1 and standard deviation o = 0.1. 





Solution to Activity 15 


You should have obtained something like the sketches below, although, 
since you may have used different scales, yours could look a bit different. 
The important thing is that the information on your horizontal axes 
should match those in the figures. 


e The following normal distribution is centred at u = 1000 and has just 
about all the distribution contained within 
1000 + (3 x 100) = 1000 + 300, i.e. between 700 and 1300. 














500 600 700 800 900 1000 1100 1200 1300 1400 1500 x 


The normal distribution with u = 1000, ø = 100 


e The following normal distribution is centred at u = 2 and has almost 
all the distribution contained within 2 + (3 x 0.25) = 2+0.75, 
i.e. between 1.25 and 2.75. 














The normal distribution with u = 2, 0 = 0.25 
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Solution to Activity 16 
(a) Here u = 10 and o = 2, so 








x — 10 
= 
(b) Here u = 100 and o = 20, so 
x — 100 
~~ 20° 
(c) Here u = 1 and o = 0.1, so 
t= 1 
a= 
If you prefer, you could equivalently write this as 
C=. 
= 1/10 = 10(x — 1) 


Solution to Activity 17 


(a) The appropriate formula is 


h-u 
z = —, 
o 
where u = 1.75 and o = 0.07. Hence 
_h-1.75 
m 


(b) When h = 1.96, 
= 1.96 — 1.75 0.21 _ 
E 0.07 0.07 — 


So a height of 1.96 metres is 3 standard deviations above the mean 
height of 1.75 metres. 


When h = 1.61, 
_ 161-175 _-014_ 
Z= — 0o07 O07. | 


So a height of 1.61 metres is 2 standard deviations below the mean 
height of 1.75 metres. 
When h = 1.785, 
= 1.785— 1.75 0.035 _ 
© 007 007 


So a height of 1.785 metres is 0.5 standard deviations above the mean 
height of 1.75 metres. 


0.5. 


You can check the picture of the distribution in Figure 14 
(Subsection 3.2) to see if each of the values of h in this activity is the 
appropriate z standard deviations away from the mean. 


Solutions to activities 


Solution to Activity 18 


(a) In each case the sampling distribution is symmetric about a mode at 
about 66 marks. So the means of the sampling distributions appear to 
be the same as the population mean u = 66 marks. 


(b) The sampling distributions all look symmetric with a mode at about 
£491 or so. So again the mean of each of the sampling distributions 
appears to be the same as the population mean p = £491. 

Solution to Activity 19 


(a) The standard deviation of the sampling distribution of mean exam 
marks decreases (i.e. the distributions become more compressed) as 
the sample size n increases. 


(b) The standard deviation of the sampling distribution of mean 
employees’ earnings also decreases (i.e. the distributions become more 
compressed) as the sample size n increases. 

Solution to Activity 20 

(a) When n = 25, 

o 22 | 
yn v25 

(b) When n = 50, 


4.4, 


o 22 311 
vn VIO 
(c) When n = 100, 


o 22 99 
vn vioo | 


Solution to Activity 21 


When n = 25 and o = 0.01, 


o 0.01 0.01 
—— = 0.002. 


vn v5 5 
It follows that the sampling distribution of the mean for samples of 
25 ball bearings from this manufacturer is approximately normal with 
mean u = 2mm and standard deviation o/yn = 0.002 mm. 








Solution to Activity 22 


The value of z is 
T—A 112 — 120 
Z = —————— = — 
SE 15/v100 


Solution to Activity 23 


~ —5.33. 


If Hp is rejected at the 5% significance level but not at the 1% significance 
level, then z lies in the critical region shown in Figure 32 but not in the 
critical region shown in Figure 33, that is 1.96 < z < 2.58 or 

—2.58 < z < —1.96. This is shown in the following figure. 
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O-+ ------------------------ 






z must lie in one 
of these intervals 






Sketch of standard normal distribution with possible values of z indicated 


Solution to Activity 24 


(a) 


The null and alternative hypotheses are 
Ao: u = 26.1 
Hı: u Æ 26.1, 
where u is the population mean set-up time of the new method. 


A = 26.1, as this is the value of u under Hp. The sample values are 
T = 20.9 and n = 53. 


The test statistic is 
Z-A 20.9— 26.1 
SE 12.3//53 


As —3.08 is less than —1.96 and —2.58, the null hypothesis is rejected 
at both the 5% significance level and the 1% significance level. 


~ —3.08. 


= 





There is strong evidence against Ho. Thus there is strong evidence 
that the mean set-up time under the new method differs from that 
under the old method — there is strong evidence that the new method 
is faster. 


Solution to Activity 25 


The appropriate null and alternative hypotheses are 
Ho: p = 116 
Hı: u Æ 116, 


where u is the population mean reading score of all British 8-year-old 
children in 2004-2005. 


As the sample size, 283, is much greater than 25, it is appropriate to apply 
the z-test. We have 


A=116, F=126.92, n=283, s=27.711. 


The test statistic is 
T-A T—A 126.92 — 116 


zi pee i 
ESE  s/yn  27.711/v283 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 

6.63 > 2.58, we can reject the null hypothesis at the 1% significance level 
and conclude that there is strong evidence that the mean reading score of 
8-year-olds in 2004-2005 was not equal to 116. 


63. 


This might have been a little surprising if we had not seen a similar result 
for 7-year-old children in Example 7. 


Solution to Activity 26 

The null and alternative hypotheses are 
Ho: u = 381.50 
Ay: u # 381.50, 


where u is the population mean weekly wage (in £) of female local 
government clerical officers and assistants in 2011. 


As the sample size, 810, is greater than 25, it is appropriate to apply the 
z-test. We have 


A= 381.5, £=373.4, n=810, s=138.2. 


The test statistic is 
T-A T-A _ 373.4— 381.5 7 


~ ESE  s/yn  138.2/vV810 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
—1.96 < —1.67 < 1.96, the null hypothesis is not rejected at the 5% 
significance level. There is little evidence that the mean weekly wage of 
female local government clerical officers and assistants differed from the 
overall mean weekly wage of female employees in 2011. 


1.67 


Solution to Activity 27 
The null and alternative hypotheses are 
Ho: Hg = Hp 
Hı: Hg F H, 
where jg and up are the population mean reading scores for 8-year-old 
girls and boys, respectively. We have: 
Tg = 127.49, Tp = 126.38, ng = 138, np = 145, 
Sg = 25.064, sp = 29.927. 


Both ng = 138 and np = 145 are greater than 25, so we can assume that 
the z-test is applicable. 
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The estimated standard error is 








ae 52 A s? / 25.0642 n 29.927? 3 ong 
Ving m 138 145 0 °°”? 


and the test statistic is 
Tg — Tp 127.49 — 126.38 
= YS A 0.854. 
= “RSE 3.276 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
—1.96 < 0.34 < 1.96, we cannot reject the null hypothesis at the 5% 

significance level. There is no reason to doubt that the mean reading 
scores of 8-year-old girls and boys were the same. 





Solution to Activity 28 


Let ‘E’ denote quantities relating to children whose parental education 
ended by age 16, and ‘C’ denote quantities relating to children whose 
parental education continued after age 16. The null and alternative 
hypotheses are 


Ho: uc = Hg 
Ay: uc F Mp; 


where uç and up are the population mean reading scores of interest. 
We have: 


Tg = 116.12, Fo=123.15, ne =389, no = 199, 
SE = 28.775, so = 24.603. 


Both nc = 199 and ng = 389 are greater than 25, so we can assume that 
the z-test is applicable. 


The estimated standard error is 


s2 s2 24.6032 28.7752 
ESE = ,/ Æ 4 E — ~ 2.274 
\ no i ng 199 ` 389 i 


and the test statistic is 
Te — TE 123.15 — 116.12 
ee — x 3.09. 
“= ESE 2.274 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 

3.09 > 2.58, we can reject Ho at the 1% significance level. There is strong 
evidence that the mean reading score of children whose parental education 
continued after age 16 differs from the mean reading score of children 
whose parental education ended by age 16. There is strong evidence that 
the children of parents who stayed longer in full-time education did better 
than those of parents who left education earlier. 
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Solutions to exercises 


Solution to Exercise 1 


(a) 


The 8-year-old children are identified in Table 1 by having a value of 2 
in the fourth column (Coded age ‘2’ denotes 8 years old). There are 
four individuals in Table 1 that have 2 in the fourth column. They 
have reading scores 


118 115 56 136. 
The sample size is n = 4. 
To calculate 7, 
X x = 118 + 115 + 56 + 136 = 425, 


and so 

Ya 425 
n 4 
To calculate s, 


Se - 7)? = yr = 22) 


4252 





T 





= 48781 — 
= 3624.75, 





which means the variance is 
X (z — T) _ 3624.75 
cae ar 


n— 1 


= 1208.25. 


So 
s = vV variance = vy 1208.25 ~ 34.8. 


Solution to Exercise 2 


(a) 


The children of interest in this exercise are identified in Table 1 by 
having a value of 1 in the fifth column (Parental education ‘1’ denotes 
finished aged 16 or less) and a value of 1 in the sixth column (Father’s 
occupation ‘1’ denotes managerial, technical, professional and skilled 
non-manual occupations). There are seven individuals in Table 1 that 
have 1 in both the fifth and sixth columns. They have reading scores 


123 110 134 110 172 136 160. 
The sample size is n = 7. 
To calculate 7, 


S x = 123 +110 + 134 + 110 + 172 + 136 + 160 = 945, 


and so 








4 
2r 945 135. 
n T 


T 
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To calculate s, 


Se - 7)? = y = az 


945? 
= 130965 — 


= 3390, 





which means the variance is 
Yi(e—z)? — 3390 _ 
n— 1 © 6 


565. 


So 
s = vV variance = V 565 ~ 23.8. 


Solution to Exercise 3 
Suitable null and alternative hypotheses are 


A: For British children aged 8 in 2004-2005, the mean reading score 
for girls was equal to the mean reading score for boys 


Hı: For British children aged 8 in 2004-2005, the mean reading score 
for girls was not equal to the mean reading score for boys. 
Solution to Exercise 4 


(a) For Population A, the six different samples of size 2 with their sample 
means are listed below: 




















Sample: 10 20; sample mean = = 5 a = 7 15 
Sample: 10 30; sample mean = = 5 A = = = 20 
Sample: 10 40; sample mean = A 7 L = 2 = 25 
Sample: 20 30; sample mean = a 5 cas = ~ = 25 
Sample: 20 40; sample mean = a 5 a = a = 30 
Sample: 30 40; sample mean = 2 5 X = 7 = 35 


The sample means are plotted along the horizontal axis in the 
following figure. 





(J 
e ® @ e e 
15 20 25 30 35 


Plot of values of sample means from Population A 
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(b) For Population B, the six different samples of size 2 with their sample 
means are listed below: 

















Sample: 10 38; sample mean = 2 $ 2 = = = 24 
Sample: 10 39; sample mean = = 7 = = 2 = 24.5 
Sample: 10 40; sample mean = a - 2 = = = 25 
Sample: 38 39; sample mean = 2 5 a = a = 38.5 
Sample: 38 40; sample mean = 2 $ = = l = 39 
Sample: 39 40; sample mean = a = a = 39.5 


The sample means are plotted along the horizontal axis in the 
following figure. 


54 95 96 27 28 99 30 31 32 33 34 35 36 37 38 39 40 





Plot of values of sample means from Population B 


(c) The points in the graph in part (a) are symmetrically distributed 
around a central mode, while the points in the graph in part (b) are 
split into two groups some distance apart. Hence the graph in part (a) 
seems more bell-shaped than the graph in part (b). This happens 
because the points in Population A are more symmetric — and more 
evenly spread out — than the points in Population B, which consist of 
three points close together (38, 39 and 40) and another far away (10). 


Solution to Exercise 5 


The distribution of reading scores — ‘sample means’ when n = 1 — is very 
jagged, but if you squint your eyes you get an impression of a fairly 
symmetric distribution with perhaps a slight preponderance of low, as 
opposed to high, values. The distribution of sample means of size n = 2 is 
smoother, though still with some jaggedness towards its right-hand side, 
fairly close to symmetric but with a little bit of left skewness. When n = 3 
the distribution is smoother again, and any lack of symmetry is pretty 
small. It is also clear that the vertical scale of the sampling distribution of 
the mean when n = 3 is larger than the vertical scale of the distribution of 
the data (n = 1). By the time n = 10, the distribution of sample means is 
very smooth, symmetric, bell-shaped/normal-like and with a larger 
vertical scale still. 


So, again, we see that even though the population distribution is not 
especially normal-like, as the sample size n increases, the sampling 
distribution of the mean quite quickly becomes much more normal-like. 
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Solution to Exercise 6 


The mode of this normal distribution occurs at about x = 10. So u œ 10. 
Almost all the distribution is contained between x = 0 and x = 20 

(i.e. within 10+ 10). So 30 œ 10 and o ~ 3.33. That is, the normal 
distribution plotted in Figure 28 is approximately the normal distribution 
with mean u = 10 and standard deviation o = 3.33. 





Solution to Exercise 7 


The mode of this normal distribution occurs at about x = —5. So u œ —5. 
Almost all the distribution is contained between x = —12 and x = 2 

(i.e. within —5 +7). So 30 ~7 and o ~ 2.33. That is, the normal 
distribution plotted in Figure 29 is approximately the normal distribution 
with mean u = —5 and standard deviation ø = 2.33. 





Solution to Exercise 8 


You should have obtained something like the sketch in the figure below, 
although, since you may have used different scales, yours could look a bit 
different. This normal distribution is centred at u = —1 and has just 
about all the distribution contained within —1 + (3 x 1) = —1 3, 

i.e. between —4 and 2. 
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A normal distribution with u = —1, o = 1 


Solution to Exercise 9 


You should have obtained something like the sketch in the figure below, 
although, since you may have used different scales, yours could look a bit 
different. This normal distribution is centred at u = 4 and has just about 
all the distribution contained within 4+ (3 x 4) = 4 +12, ie. between —8 
and 16. 
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A normal distribution with u = 4,0 = 4 


Solution to Exercise 10 
(a) Here u = 6 and o = 3.3, so 
x—6 
33° 
(b) Here u = —6 and ø = 2, so 
= x — (—6) _ c+6 
2 2 





~— 





Solution to Exercise 11 


The appropriate formula is 








_ B= 2 
z= 
When z = 3, 
3-2 1 
—_—_ = — d 
10 10 j 


Solution to Exercise 12 
The appropriate formula is 
_ (1) z+1 
0.5 1/2 
When x = 0, 
g=2(0+1) =2. 


Zz 





2(x +1). 


Solution to Exercise 13 
(a) When n = 9, 
o 283 283 


— = — = — 2 94.3. 
i/o 3 
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(b) When n = 25, 
283 283 











o 
56.6. 
vn v25 5 
(c) When n = 100, 
2 
T _ 23 _ 283 _ ogg 
yn 100 10 
Solution to Exercise 14 
(a) When n = 4, 
v _36_ 36 45 
Jn «4.2 
(b) When n = 19, 
o 3.6 
— = —— ~ 0.83. 
yn vig 
(c) When n = 300, 
o 3.6 
— = — ~ 0.21. 
vn 300 


Solution to Exercise 15 
When n = 40 and o = 0.01, 


o 0.01 
— = —— ~ 0.0016. 
yn  ~40 


It follows that the sampling distribution of the mean for samples of 40 
one-litre bottles of water from this manufacturer is approximately normal 
with mean „u = 1.01 litres and standard deviation o/\/n = 0.0016 litres. 


Solution to Exercise 16 
The appropriate null and alternative hypotheses are 
Ho: u = 96 
Ay: u #96, 
where u is the population mean reading score of all British 7-year-old girls 


in 2004-2005. 


As the sample size, n = 190, is much greater than 25, it is appropriate to 
apply the z-test. We have 


A=96, F=113.42, n=190, s= 25.464. 


The test statistic is 
T-A 7-A 113.42 — 96 9.43 
= = = Lae i 
ESE s/n 25.464/./190 
The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
9.43 > 2.58, we can reject the null hypothesis at the 1% significance level. 





Hence there is strong evidence that the mean reading score of 7-year-old 
girls in 2004-2005 is not equal to 96. 


This result corresponds to the similar result observed for all 7-year-old 
children (not just girls) in Example 7. 


Solution to Exercise 17 

The null and alternative hypotheses are 
Ho: u = 80 
Hı: u #80, 


where u is the mean weight (in kg) of this breed of pig when fed the 
special diet. 


As the sample size, 533, is greater than 25, it is appropriate to apply the 
z-test. We have 


A=80, £=81.92, n= 533, s= 15.65. 


The test statistic is 

Z-A T-A 8192—80 ~ 2.83 

ESE  s/yn  15.65/vV533 ` 

The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
2.83 > 2.58, the null hypothesis is rejected at the 1% significance level. 
There is strong evidence that the mean weight of this breed of pig when 
fed the special diet is not equal to 80 kg. There is strong evidence that the 
mean weight is higher for the special diet. 


Solution to Exercise 18 

(a) The null and alternative hypotheses are 
Ao: w=4 
Hı: p #4, 


where u is the population mean drying time in hours of the 
manufacturers’ paint. As the sample size, n = 40, is greater than 25, it 
is appropriate to apply the z-test. We have 


A=4, 7=3.80, n=40, s= 0.55. 


The test statistic is 

T-A T-A 3.80 — 4 

ESE s/yn 0.55/40 
The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
—2.58 < —2.30 < —1.96, we can reject the null hypothesis at the 5%, 
though not at the 1%, significance level. We conclude that there is 
moderate evidence that the drying time given by the consumer 
magazine is incorrect. The manufacturers’ paint appears to dry more 
quickly than the magazine claimed. 





(b) You might think that such ‘marginal’ (moderate) evidence is not 
enough to conclude that the manufacturers’ paint dries more quickly 
than the consumer magazine claimed. 
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The time at which paint is declared ‘dry’ is not well-defined: different 
customers might measure drying time differently or have different 
ideas about what ‘dry’ means. 


Even if the measures are reliable and the test result is correct, 0.20 
hours or 12 minutes is not a very large reduction. Most customers 
would not consider this an important difference. 


You may have thought of other reservations. 


Solution to Exercise 19 


The null and alternative hypotheses are 


Ho: fy = Hy 
Hi: Hı F Ho, 


where u; and u are the population mean reading scores of interest. Here 
and below, ‘1’ denotes quantities relating to children with father’s 
occupation coded 1 and ‘2’ denotes quantities relating to children with 
father’s occupation coded 2. The summary statistics are: 


zı = 120.55, Z=117.17, nm, =316, no = 203, 
sı = 24.221, s2 = 30.085. 


Both nı = 316 and n2 = 203 are greater than 25, so we can assume that 
the z-test is applicable. 


The estimated standard error is 


ls? 32 24.2212 30.0852 
ESE = 4| SS ~ 2.513, 


and the test statistic is 
Tı — T2 120.55 — 117.17 


= pp EO ee 
” = “BSE 2.513 


(You might have got 1.34, correct to two decimal places, if calculating z all 
in one go. Such a difference doesn’t matter.) 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
—1.96 < 1.35 < 1.96, we cannot reject Hg at the 5% significance level. 
There is little evidence that the mean reading score of children whose 
father had an occupation coded 1 differed from the mean reading score of 
children whose father had an occupation coded 2. This goes against 
conventional wisdom, at least from other contexts. 











Solution to Exercise 20 


Let ‘A’ denote quantities relating to breast-fed babies and ‘B’ denote 
quantities relating to bottle-fed babies. The null and alternative 
hypotheses are 


Ho: a = HB 
Hı: HA -Á Mp; 


where u4 and upg are the population mean serum calcium levels of 
interest. The summary statistics are: 


Za=2.45, Fp=2.30, na=64, np = 169, 
sa = 0.292, sp = 0.274. 


Both n4 = 64 and ng = 169 are greater than 25, so we can assume that 
the z-test is applicable. 


The estimated standard error is 


|s% 52 0.2922 0.2742 
ESE=,/4+482= ~ 0.042 
NA a NB 64 > 169 i 


and the test statistic is 
_Ea—Zp _ 2.45 —2.30 


= aon oa 


(You might have got 3.56.) 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 

3.57 > 2.58, we can reject the null hypothesis at the 1% significance level. 
There is strong evidence that the mean serum calcium level of week-old 
babies was different depending on whether they were breast-fed or 
bottle-fed. The evidence is that it was higher in those who were breast-fed. 








Solution to Exercise 21 


Let ‘R’ denote quantities relating to children from rural areas and ‘U’ 
denote quantities relating to children from urban areas. The null and 
alternative hypotheses are 


Ho: ug = by 
Hı: ug F by, 


where upg and py are the population mean peak flow rates of interest (in 
litres per minute). The summary statistics are: 


Ty = 226, TR=231, ny =485, nr = 637, 
su = 92, sp =953. 


Both ny = 485 and ng = 637 are greater than 25, so we can assume that 
the z-test is applicable. 


The estimated standard error is 


s2 s2 532 522 
ESE = 4| Æ + Æ =4/—_ + = ~x 3.160 
NR, t nuy 637 t 485 f 
and the test statistic is 
Zr— y 231 — 226 
= 858, 
” = ESE 3.160 


The critical values are 1.96, —1.96 (5%) and 2.58, —2.58 (1%). Since 
—1.96 < 1.58 < 1.96, we cannot reject the null hypothesis at the 5% 
significance level. Thus, there is little evidence to suggest that the mean 
peak flow rate differs between children who live in rural areas and those 
who live in urban areas. 
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